1 00:00:00,800 --> 00:00:02,960 Equidox by Onix  2 00:00:03,840 --> 00:00:08,560 Reach everyone. 3 00:00:08,560 --> 00:00:14,880 [Dan Tuleta]  All right, well welcome everyone to another edition of Equidox Webinar Wednesdays. It's   4 00:00:14,880 --> 00:00:17,120 just about two o'clock, so I  think we should get started.   5 00:00:18,240 --> 00:00:22,880 Now for anyone that has joined us before, we  really appreciate you continuing to attend   6 00:00:22,880 --> 00:00:30,320 these webinars to learn a bit more about document  remediation and Equidox. So today's presentation   7 00:00:30,320 --> 00:00:36,400 is going to be talking about refining and  designing a workflow for attacking these documents   8 00:00:36,400 --> 00:00:42,800 from a remediation standpoint. So as always please  feel free to reach out to us at any given time   9 00:00:42,800 --> 00:00:53,920 through EquidoxSales@onixnet.com. Our website is  www.Equidox.co and our 800 number is 800-664-9638.   10 00:00:54,640 --> 00:01:01,040 So we do encourage follow-up questions or feedback  for these webinars and if you'd like to see any   11 00:01:01,040 --> 00:01:07,680 personalized demonstrations of Equidox, maybe more  tailored to your specific workflow, please feel   12 00:01:07,680 --> 00:01:14,400 free to reach out to us. We would love to start  a conversation with you. Now before we get into   13 00:01:15,040 --> 00:01:19,600 the Equidox specifics, I just wanted to mention  a couple of quick things about our parent company   14 00:01:19,600 --> 00:01:26,160 Onix Networking. Onix Networking is primarily a  Google cloud consultancy and an AWS consultancy   15 00:01:26,160 --> 00:01:32,000 as well. Our mission is to improve organizational  efficiency through cloud computing solutions. We   16 00:01:32,000 --> 00:01:39,040 are headquartered in Cleveland, Ohio with offices  in Canada as well, and we do have quite a few   17 00:01:39,040 --> 00:01:46,400 remote employees scattered across North America.  We've been in business since 1992, so quite a long   18 00:01:46,400 --> 00:01:52,880 time in the IT space, and one of the one of the  actual divisions of Onix is of course Equidox,   19 00:01:53,520 --> 00:01:59,120 where we focus mainly on digital accessibility with a niche in PDF documents. So, we're going   20 00:01:59,120 --> 00:02:05,200 to be talking a bit about our Equidox software  which is designed for PDF remediation. And we'll   21 00:02:05,200 --> 00:02:12,320 be going through a couple of ways of looking at  various workflows for approaching PDF documents   22 00:02:12,320 --> 00:02:17,680 in the most efficient ways to remediate them.  And I’ll actually be going through a few examples   23 00:02:17,680 --> 00:02:24,160 in the Equidox software. So for anyone that has  seen a demonstration or maybe is an existing user   24 00:02:24,160 --> 00:02:30,000 of Equidox, some of this might be a little  bit of a review, from perhaps a recent demo   25 00:02:30,000 --> 00:02:33,840 or training session we might have had. But for  those of you who have not seen Equidox before,   26 00:02:33,840 --> 00:02:40,000 you'll get to get a little bit of a feel for what  PDF remediation can look like in our tool. Now   27 00:02:41,280 --> 00:02:49,040 just a quick little blurb about who we serve... some  of some of our clients: Carnival, Bose, PNC,   28 00:02:49,040 --> 00:02:56,240 Pella, NYU. We serve a number of clients across  all different verticals from large to small.   29 00:02:56,960 --> 00:03:02,080 So we do appreciate everyone joining us today  regardless of the size of the organization or   30 00:03:02,640 --> 00:03:06,640 which type of market you are actually in.  So thank you very much for attending today.  31 00:03:08,080 --> 00:03:15,120 Now in terms of document remediation, if you  are assigned with remediating documents… if   32 00:03:15,120 --> 00:03:19,920 this is your everyday job… or if this  is just a one-off project, maybe that   33 00:03:19,920 --> 00:03:25,600 that you've been asked to complete as a  content creator, there's a few things to   34 00:03:25,600 --> 00:03:31,840 consider in the pre-flight process. So before  you get started with remediating a document,   35 00:03:32,640 --> 00:03:37,920 one of the first things that I like to look at is  how many pages are there, If there are just one   36 00:03:37,920 --> 00:03:43,760 or a couple of pages in this document, you know,  then then you can kind of just power through it   37 00:03:45,120 --> 00:03:49,600 and just kind of approach it by just opening the  document up and taking a look at what you have   38 00:03:49,600 --> 00:03:55,040 and start working on it. Now if there are a lot  of pages in this document, typically I like to   39 00:03:56,320 --> 00:04:01,280 take a few minutes to just investigate what  the document is compiled of. Because you might   40 00:04:01,280 --> 00:04:07,040 choose to be a little bit more strategic and  tactical in how you approach that document.   41 00:04:07,040 --> 00:04:12,240 Because there could be a number of things that  you could consolidate… steps along the way   42 00:04:12,240 --> 00:04:17,440 that will eliminate redundancies and, you know,  forcing you to repeat steps page after page.   43 00:04:18,080 --> 00:04:22,000 So we'll take a look at a few of those  things later on during the demonstration.   44 00:04:22,960 --> 00:04:29,360 Now another thing to consider is how complex is  the content? Now a document might have a lot of   45 00:04:29,360 --> 00:04:36,800 pages, but if you consider a document maybe that's  just all text, for example, where you just have a   46 00:04:36,800 --> 00:04:42,080 series of paragraphs page after page after page…  The great thing about Equidox is its ability   47 00:04:42,080 --> 00:04:47,040 to auto-detect that type of content. And it's  going to do a lot of the heavy lifting for you.   48 00:04:47,840 --> 00:04:52,960 So the complexity of the content does have a  lot to do with your workflow as you remediate.   49 00:04:53,520 --> 00:05:00,880 Now a 100-page document made up entirely of text  elements could be easier than a 10-page document   50 00:05:01,440 --> 00:05:07,120 although it's much longer. A 10-page document  might have very technical engineering drawings, or   51 00:05:08,000 --> 00:05:15,600 very complex elements that take more actual work to tag properly. So something to look at is the   52 00:05:15,600 --> 00:05:20,400 complexity of the content. To kind of build out  the workflow in your head before you get started.   53 00:05:21,280 --> 00:05:26,080 Another thing that I look at is, is the design  and the formatting of the document consistent   54 00:05:26,080 --> 00:05:33,040 throughout? So is this document coming from  a template? Is there a consistent use of   55 00:05:33,040 --> 00:05:38,160 heading structure throughout? Are the lists  properly formatted throughout the document?   56 00:05:38,160 --> 00:05:43,440 How consistent is this page after page?  Sometimes I see documents which are almost   57 00:05:43,440 --> 00:05:50,960 like a Frankenstein document. Where it's a series  of pages that have come together from various   58 00:05:50,960 --> 00:05:56,480 locations. There's nothing consistent about them.  Every page is unique, and every page has to be   59 00:05:56,480 --> 00:06:03,360 treated like as its own individual puzzle. So  it's just important to note if there's consistent   60 00:06:03,360 --> 00:06:09,600 formatting throughout the document. There are  ways within Equidox to consolidate steps and to   61 00:06:09,600 --> 00:06:14,320 eliminate some redundancies so that you don't have  to repeat the same exact steps page after page.   62 00:06:15,920 --> 00:06:21,520 Now another thing that I look at always, is if  there is existing tag structure associated with   63 00:06:21,520 --> 00:06:28,160 the document. And existing tag structure could  be good or bad. Now many documents, depending   64 00:06:28,160 --> 00:06:34,000 on where they come from… for example, Microsoft  Word being a very common one, when you hit “Save   65 00:06:34,000 --> 00:06:40,080 as PDF” coming out of Microsoft Word oftentimes  there is an automated tagging process where   66 00:06:40,080 --> 00:06:48,560 Microsoft Word will attempt to convert that word  file into a PDF and generate some automated tags   67 00:06:48,560 --> 00:06:54,400 throughout that process. Now that can be good  or bad, depending on the way that the document   68 00:06:54,400 --> 00:06:59,280 was put together. But the point is that you don't  really know what you're going to get during that   69 00:07:00,720 --> 00:07:07,360 conversion process from Microsoft Word into PDF.  Because how those how those tags are generated   70 00:07:08,080 --> 00:07:12,960 is basically a mystery to the user. You just  have to kind of wait and see what you get. There   71 00:07:12,960 --> 00:07:17,040 are of course many things that you can take into  consideration when you're designing your documents   72 00:07:17,920 --> 00:07:22,400 to improve the accessibility of them  when you do convert them into a PDF.   73 00:07:23,040 --> 00:07:28,160 But they still need to be looked at  because PDF and Word are different formats   74 00:07:28,160 --> 00:07:32,560 and they just have different ways of marking  up documents from an accessibility standpoint.   75 00:07:33,120 --> 00:07:36,960 So just because your Microsoft Word  document is technically accessible,   76 00:07:36,960 --> 00:07:41,280 it does not mean that the PDF that that generates  is going to be automatically accessible. So   77 00:07:41,280 --> 00:07:46,800 you still need to take into consideration  that the existing tags that were generated   78 00:07:46,800 --> 00:07:52,160 automatically through Microsoft Word, they  might still need to be touched up or adjusted.   79 00:07:53,280 --> 00:07:59,840 Now there are other documents that a that may  have been remediated at some point in their   80 00:08:00,400 --> 00:08:06,800 life cycle where someone might have opened the  document in Adobe Acrobat and pressed auto tag   81 00:08:07,600 --> 00:08:11,840 and just kind of crossed their fingers and hoped  that they got some sort of usable tag structure.   82 00:08:13,040 --> 00:08:19,280 This can be something that is going to cause you more problems than be of service. So these are   83 00:08:19,280 --> 00:08:24,160 just things to look at. If you have existing tag structure. What is the quality of those   84 00:08:24,160 --> 00:08:30,240 existing tags, and are they worth maintaining,  or is it better to just use Equidox and start   85 00:08:30,240 --> 00:08:36,160 from scratch? Because of the automation and all of  the tools that Equidox gives you to tag documents,   86 00:08:36,960 --> 00:08:39,840 there are many instances where it  can be easier to just start over   87 00:08:41,120 --> 00:08:50,000 rather than trying to rescue poor existing tags.  Another feature that or another aspect that I   88 00:08:50,000 --> 00:08:55,520 like to look at is does the document require OCR?  And OCR stands for Optical Character Recognition.   89 00:08:57,040 --> 00:09:00,000 This can be relevant for documents  that are scanned, for example,   90 00:09:01,040 --> 00:09:06,400 or documents that might contain infographics  that have text that is actually locked inside   91 00:09:06,400 --> 00:09:13,760 of images. Now OCR certainly adds another  layer of complexity to remediation because   92 00:09:13,760 --> 00:09:17,760 in order to make these documents compatible  with screen readers and machine-readable,   93 00:09:18,560 --> 00:09:24,400 an image is simply an image. We need to extract  text from it if there is in fact text inside of   94 00:09:24,400 --> 00:09:29,920 it. So if you have a document where someone took  an old newspaper and ran it through a scanner,   95 00:09:30,480 --> 00:09:36,240 technically that document can come out of that  scanner as a PDF. But it is simply just an image   96 00:09:36,240 --> 00:09:42,160 as far as a machine reader is concerned. So  that this requires an extra step called Optical   97 00:09:42,160 --> 00:09:48,720 Character Recognition, where you're actually  analyzing the image and extracting the usable   98 00:09:48,720 --> 00:09:54,880 searchable, selectable text from it. So you have  to run this OCR process to extract that text,   99 00:09:54,880 --> 00:10:00,640 which can take it can take a little bit of extra  time, but this is just sort of the nature of the   100 00:10:00,640 --> 00:10:06,160 of the format. If you're dealing with scanned  images then you have to take the time to make sure   101 00:10:06,160 --> 00:10:11,600 that you are extracting the content from it in  order to make it machine readable. Another thing   102 00:10:11,600 --> 00:10:17,120 to consider is are there form fields present?  So documents PDF documents that are intended   103 00:10:17,120 --> 00:10:23,120 to be fillable forms oftentimes will have form  fields present within them. And this is another   104 00:10:23,680 --> 00:10:30,080 element that does take a little bit of extra care  because form fields need to have what are called   105 00:10:30,080 --> 00:10:34,240 Tooltips applied to them. So if you find that  there are form fields in the document, that's   106 00:10:34,240 --> 00:10:38,720 another thing to be aware of and another thing  to sort of budget your time and your workflow   107 00:10:38,720 --> 00:10:44,160 around… making sure that you that you take  the time to properly handle those form fields.   108 00:10:45,360 --> 00:10:50,880 And then one of the last things that I like to  look at are the images. So if there are images   109 00:10:50,880 --> 00:10:55,280 present in the document, which many times there  will be, how many images are there? About how   110 00:10:55,280 --> 00:11:02,160 many images per page are there? I mean, are these  images informative are they technical or are they   111 00:11:02,160 --> 00:11:07,200 something that are just decorative? Now decorative  images could be things like repetitive logos…   112 00:11:07,920 --> 00:11:17,200 they could be like watermark… or background  images… they could be stock images. So some sort   113 00:11:17,200 --> 00:11:23,440 of brochure where you just have two people bending  over a computer smiling at the computer screen…   114 00:11:24,240 --> 00:11:29,520 These are sorts of like stock images that are not  actually adding any content a contextual value   115 00:11:30,240 --> 00:11:34,960 to the document itself. They're simply there  for the aesthetics and to take up some space   116 00:11:34,960 --> 00:11:42,080 and maybe balance out the page. So not all images  require alt text, but that's up to the remediator   117 00:11:42,080 --> 00:11:47,600 to determine… If there are images, what kind  of alt text and what kind of effort do you   118 00:11:47,600 --> 00:11:53,920 have to make to apply that alt text? Perhaps the  image… the document is full of technical images.   119 00:11:53,920 --> 00:11:59,440 Maybe this is like engineering content where  you have a bunch of complex diagrams and graphs   120 00:11:59,440 --> 00:12:03,600 and charts. If you're just the remediator  and you're not the subject matter expert,   121 00:12:03,600 --> 00:12:08,080 you might need to incorporate someone who  knows more about that subject than you   122 00:12:08,080 --> 00:12:13,280 to help you with the alt text writing. So  these are just things to consider as you   123 00:12:13,280 --> 00:12:18,000 are going through your remediation and as you're  developing your workflow throughout that document.   124 00:12:20,960 --> 00:12:27,920 Now some collaboration ideas! Because Equidox is  a web-based application, there's a collaboration   125 00:12:27,920 --> 00:12:34,480 aspect built into the tool. So we can  actually work together and share documents   126 00:12:34,480 --> 00:12:40,080 through the Equidox platform. And because we work  on a concurrent user licensing model, that means   127 00:12:40,080 --> 00:12:45,440 you can have multiple people logged into the  account, into the same account simultaneously,   128 00:12:45,440 --> 00:12:49,760 and you can even have multiple people working  together on the same document simultaneously.   129 00:12:51,120 --> 00:12:55,680 So Equidox gives you the ability to share  documents with other users or groups   130 00:12:56,240 --> 00:13:01,280 and you can actually divide and conquer.  So let's say you have a 200-page document   131 00:13:01,280 --> 00:13:05,120 and it's got a very tight deadline. You know,  you'd like to get it posted by the end of the day.   132 00:13:05,680 --> 00:13:10,400 Well rather than one person having to  hunch over their computer all day and   133 00:13:10,400 --> 00:13:15,200 kind of panic about this very large dense  document, you might have five different users   134 00:13:15,200 --> 00:13:20,560 and everyone takes 40 pages. And then you can kind  of power through that document in an hour or two,   135 00:13:20,560 --> 00:13:24,960 rather than having one person have to stress out  and clear their calendar to make sure that they   136 00:13:24,960 --> 00:13:29,520 can get through it. So this is all made possible  because this is a cloud-based application.   137 00:13:30,080 --> 00:13:33,600 Being able to share the document and  even have multiple users working together   138 00:13:33,600 --> 00:13:40,880 simultaneously has a lot of advantages built  into it. As I mentioned before, images could   139 00:13:40,880 --> 00:13:46,640 require subject matter experts so depending on  your background or the nature of the document   140 00:13:48,240 --> 00:13:53,600 these images that are contained in the document  might be so complex that you're not exactly sure   141 00:13:53,600 --> 00:13:58,320 how to provide an accurate alt description. So  that again, the advantage of being web-based,   142 00:13:58,320 --> 00:14:04,160 is you could send the URL for the page over  through an instant messenger or through an email   143 00:14:04,160 --> 00:14:08,400 to one of your colleagues, or one of your  teammates who might be more familiar with the   144 00:14:08,400 --> 00:14:14,160 content, and just ask them to take a few minutes  to provide alt text for those images. And then you   145 00:14:14,160 --> 00:14:18,800 can do the rest of the tagging of the documents.  So from a structural standpoint, you can set your   146 00:14:18,800 --> 00:14:22,960 reading order, and your headings, and your lists  and your tables, but those images that require   147 00:14:22,960 --> 00:14:28,480 that alternate text you might not be comfortable  trying to provide that… If they are very technical   148 00:14:28,480 --> 00:14:33,280 or very complex images… so you can get someone  else involved in that workflow. Because this is   149 00:14:33,280 --> 00:14:37,200 cloud-based, they can be working on that alt text  while you are working on the rest of the document.   150 00:14:38,960 --> 00:14:44,960 And also in the Equidox application, there is  a validation tool, and a Page Notes feature,   151 00:14:44,960 --> 00:14:49,760 so assuming that you're working together with  a large team on a specific document, you can   152 00:14:49,760 --> 00:14:56,320 use the validation tool to essentially mark pages  as complete. So that you can avoid redundancies…   153 00:14:56,320 --> 00:15:00,800 so someone isn't coming into a page and making  adjustments and changing things that have already   154 00:15:00,800 --> 00:15:08,640 been marked as validated. And then also the Page  Notes feature is a way of just leaving yourself or   155 00:15:08,640 --> 00:15:13,200 other users a note. Perhaps you skipped over  an element because you weren't sure how to   156 00:15:13,200 --> 00:15:19,920 approach it, or you'd like to just have some  sort of communication with your colleagues   157 00:15:20,480 --> 00:15:25,520 as to what happened on that specific page, or  how you decided to approach it. Where they might   158 00:15:25,520 --> 00:15:32,400 be able to sort of replicate that process on  other similar pages. And Equidox will give you   159 00:15:32,400 --> 00:15:36,480 a consolidated list of all of these various page  notes that you take throughout the throughout the   160 00:15:36,480 --> 00:15:42,480 document. And whether you're an Administrator  or a power user, and you have less skilled less   161 00:15:42,480 --> 00:15:46,800 trained workers working underneath you, you  can very quickly look through the page notes   162 00:15:46,800 --> 00:15:50,960 and make sure that everything has been addressed.  If there were questions, or if there were issues,   163 00:15:50,960 --> 00:15:56,240 you can always just follow up in the in the  application and look at that specific page and   164 00:15:56,240 --> 00:16:00,320 make sure that that issue was addressed. And we'll  go through that as well in the demonstration.   165 00:16:02,320 --> 00:16:09,040 Now eliminating repetition. So, if we have a  lot of different tools built into Equidox that   166 00:16:09,040 --> 00:16:16,000 allow you to programmatically apply things to all  pages. So, if you are an existing Equidox user or   167 00:16:16,000 --> 00:16:20,480 perhaps you've seen a demonstration before, you're  probably familiar with our Zone detection slider.   168 00:16:21,280 --> 00:16:27,280 So this is a tool that will redefine or detect  the reading zones within a document. And if   169 00:16:27,280 --> 00:16:31,120 you find that you have a pretty consistent  formatting throughout the entire document,   170 00:16:31,120 --> 00:16:36,880 you can apply this to all pages. Now the same  thing can be done for the reading order. So   171 00:16:36,880 --> 00:16:41,840 if you have, for example, every page in  the document is a two-column reading order,   172 00:16:41,840 --> 00:16:45,840 you can choose two columns as that  reading order and hit “Apply to all pages”   173 00:16:45,840 --> 00:16:50,800 and then Equidox will programmatically set that  two column layouts to every one of those pages.   174 00:16:51,840 --> 00:16:55,680 We also have the ability to set up a heading  template. So again, assuming that the document   175 00:16:57,120 --> 00:17:03,440 is consistently formatted and designed throughout,  you can actually identify your heading hierarchy.   176 00:17:03,440 --> 00:17:09,760 So for example if you find that your heading level  twos are all in the same font style, the same font   177 00:17:09,760 --> 00:17:15,280 characteristics, you can hit a simple checkbox and  Equidox will programmatically look forward in the   178 00:17:15,280 --> 00:17:21,520 document and identify all of the different areas  of the document where you have that same font   179 00:17:21,520 --> 00:17:26,480 style and will automatically apply that heading  level two for you. And that can be done for all   180 00:17:26,480 --> 00:17:32,320 of your various heading levels. We also have  an “Ignore” feature. The Ignore feature is   181 00:17:32,320 --> 00:17:38,400 the ability to bulk artifact repetitive  elements in a document. So if you have   182 00:17:39,120 --> 00:17:44,560 a footer or a header (maybe it's like a revision  date or a serial number, something that is just   183 00:17:44,560 --> 00:17:51,280 continuously repeating on every single page) you  don't necessarily need to tag that on every page.   184 00:17:51,280 --> 00:17:54,640 And you can use the ignore feature,  if it's located in the same location,   185 00:17:55,200 --> 00:18:00,080 to bulk artifacts all of that information  throughout the entire document. So that you're   186 00:18:00,080 --> 00:18:05,200 not feeding this redundant information to the  screen reader user page after page after page.   187 00:18:07,440 --> 00:18:13,040 Now another important part of this is to keep  a rhythm. So, I find when I’m working on longer   188 00:18:13,040 --> 00:18:20,160 documents particularly, that remediation can be  a very rhythmic process. So if you have similar   189 00:18:20,160 --> 00:18:25,200 layouts and designs throughout the document,  a lot of those pages will require the same   190 00:18:25,200 --> 00:18:31,520 techniques. And once you get through a handful of  pages, you will start to eliminate redundancies   191 00:18:31,520 --> 00:18:37,200 and consolidate steps. So that you can start  just eliminating, literally eliminating clicks.   192 00:18:37,200 --> 00:18:42,000 And if you're if you're doing a high volume  type of project where you have a lot of pages to   193 00:18:42,000 --> 00:18:47,360 get through, if you can eliminate a handful  of clicks or a handful of steps on every page,   194 00:18:47,360 --> 00:18:51,760 that really adds up to a lot of time savings  at the end of that document, or the end of that   195 00:18:51,760 --> 00:18:59,200 project. So, as you get into a rhythm, I find it's  very helpful to try to stay within that rhythm.   196 00:18:59,840 --> 00:19:04,080 And then the outlier type of pages,  perhaps you come across a page   197 00:19:04,080 --> 00:19:10,160 where you have a completely different layout, or  there's a large infographic or a flow chart or   198 00:19:10,160 --> 00:19:13,360 something that is completely different than the  pages that you've been previously working, on   199 00:19:14,400 --> 00:19:18,640 you can always skip over that page and just  return to it at the very end of the process.   200 00:19:19,200 --> 00:19:24,320 So rather than disrupting that flow that you've  just developed for the previous 10 or 20 pages,   201 00:19:24,320 --> 00:19:28,960 you can just skip over it and go back  to that flow chart or back to that   202 00:19:28,960 --> 00:19:34,560 infographic that requires a lot of extra work.  And then you can kind of finish off the document   203 00:19:34,560 --> 00:19:39,120 at the very end by addressing that outlier  type of page at the very end of the workflow.   204 00:19:41,040 --> 00:19:46,400 Now another part of Equidox is validating as you  go. And this is something that we will also cover   205 00:19:46,400 --> 00:19:52,160 in the in the demonstration when we get  to it. So the HTML preview, if you've   206 00:19:52,160 --> 00:19:58,400 used Equidox before I’m sure you're familiar  with this, but the HTML preview is a way of   207 00:19:58,400 --> 00:20:03,360 checking the structure that you've built on the  page in terms of reading order, your headings,   208 00:20:03,360 --> 00:20:08,960 lists, tables. All of that information that  you have structured in that remediation process   209 00:20:08,960 --> 00:20:15,920 will be previewable in the HTML preview page. And  this is a way of just checking your work before   210 00:20:15,920 --> 00:20:20,880 moving on to a subsequent page. So, you'll get  a lot of feedback from the application as you're   211 00:20:20,880 --> 00:20:26,160 going through at the page level. And if you're  struggling with a certain page, you can always   212 00:20:26,160 --> 00:20:30,960 revisit it and finish it off at the very end. So,  it's just a way of checking your work as you go.   213 00:20:31,920 --> 00:20:36,480 Also when you are finished working through  a document, when you export the PDF,   214 00:20:38,000 --> 00:20:44,000 Equidox will run a series of accessibility  checks on that export. Now the bullet points   215 00:20:44,000 --> 00:20:48,160 here on the slide… this is not a comprehensive  list of everything that it is checking for,   216 00:20:48,880 --> 00:20:53,680 but just a few examples of the things that it's  looking for are missing alt text for images…   217 00:20:53,680 --> 00:20:59,440 if you have illogical reading orders or logical  heading structures… invalid merges… for example,   218 00:20:59,440 --> 00:21:05,280 if you are trying to merge a list with a  text zone, for example… And then the errors   219 00:21:05,840 --> 00:21:11,280 that Equidox finds throughout this accessibility  check, they are directly linked to the specific   220 00:21:11,280 --> 00:21:15,360 page where it was found on. And they give you  a clickable link where you can just select the   221 00:21:15,360 --> 00:21:19,840 link and go straight to the page where that error  was found. And then you can make your correction   222 00:21:19,840 --> 00:21:24,800 in Equidox and then just re-export the document  once you've worked through all of those errors.   223 00:21:25,680 --> 00:21:29,840 So this accessibility check  that we are applying on export,   224 00:21:30,960 --> 00:21:36,480 as I said it's not a fully comprehensive  accessibility report that you are getting…   225 00:21:37,440 --> 00:21:41,520 We still highly recommend using  third-party validation tools   226 00:21:41,520 --> 00:21:47,440 and especially screen readers. Screen readers  to replicate exactly what a screen reader user   227 00:21:47,440 --> 00:21:52,080 is experiencing when they're working with this  document. Now automated accessibility checking…   228 00:21:52,720 --> 00:21:57,840 it can be automated to a certain degree but there  are many, many different elements that it can miss   229 00:21:58,640 --> 00:22:03,920 because it's simply looking for technical  requirements. But it's not actually able to   230 00:22:04,640 --> 00:22:08,960 replicate what an end user is going through with a  screen reader. So it's very important to make sure   231 00:22:08,960 --> 00:22:15,120 that you are that you are checking your work as  you as you export the document. Putting it through   232 00:22:15,120 --> 00:22:22,880 various third-party tools, or using screen readers  which is the best and the ultimate check for the   233 00:22:22,880 --> 00:22:29,760 actual usability of that document. Okay, so let's  jump into Equidox now, and we'll work through a   234 00:22:29,760 --> 00:22:34,800 couple of examples here… and I realize that we’ve  got about 10 minutes left. So I’m going to go   235 00:22:34,800 --> 00:22:39,120 pretty quickly through these documents. But that's  sort of the point with Equidox, is to be able to   236 00:22:39,120 --> 00:22:44,560 move very efficiently through them. So I have a  couple of documents pre-loaded here into Equidox   237 00:22:44,560 --> 00:22:49,760 and I want to start with this one here. And this  is just a one-page document. So as I said before,   238 00:22:49,760 --> 00:22:53,840 when you have a one-page document, sometimes it's  best to just jump right in and see what you have   239 00:22:54,480 --> 00:22:59,440 rather than trying to spend a few minutes  to be very strategic and tactical with it.   240 00:22:59,440 --> 00:23:04,640 You can just approach this page and just see what  you've got in here, and start working through it.   241 00:23:04,640 --> 00:23:09,360 Now what I notice right away is I have some  images, I have some headings, I have text,   242 00:23:09,360 --> 00:23:15,120 I have a list here, ( in fact, a nested list), and  a table down here at the very bottom. So just to   243 00:23:15,120 --> 00:23:20,160 quickly work through this, what I’m going to do is  I’m going to start by, at the top and just start   244 00:23:20,160 --> 00:23:24,560 artifacting images that I don't really need.  I don't know what this is, just a background   245 00:23:24,560 --> 00:23:32,800 sort of watermark type of image… Here's another  image of black and white dog holding a briefcase…   246 00:23:34,320 --> 00:23:39,600 here's our Equidox logo up here. So I’m  just typing in all descriptions for them.   247 00:23:39,600 --> 00:23:43,760 Now I’m going to work on my heading structures. So  I hit… everything right now is currently set as a   248 00:23:43,760 --> 00:23:48,800 text zone because this document was untagged  to begin with. So if I just start hitting my   249 00:23:48,800 --> 00:23:55,120 appropriate keyboard shortcuts to set my headings.  Heading level 1, heading level 2, heading level 2,   250 00:23:55,840 --> 00:24:02,080 and heading level 2. Now I’ve defined my headings  for this page. Next up is this list. As you can   251 00:24:02,080 --> 00:24:07,920 tell this is a nested list, and what I’ll do is  I will hit “L” on my keyboard and I will use my   252 00:24:07,920 --> 00:24:13,280 List Detection slider. So if I move this over,  Equidox is going to look inside of this list and   253 00:24:13,280 --> 00:24:18,400 it's going to pick up these nested layers inside  of the individual list items. So if you can tell,   254 00:24:18,400 --> 00:24:23,040 or if you've used Equidox before, you'll notice  that this list has been automatically detected,   255 00:24:23,040 --> 00:24:26,880 and the nestings have been detected  inside of those individual list items,   256 00:24:26,880 --> 00:24:31,680 simply by moving this list detection slider. So  this is using artificial intelligence and machine   257 00:24:31,680 --> 00:24:36,400 learning to better understand list structures,  and something like this only takes a few seconds.   258 00:24:37,280 --> 00:24:41,600 Now this down here is a table, and if you can  tell I have a bunch of different text zones   259 00:24:41,600 --> 00:24:47,440 covering up this table because it has no existing  tag structure. So I’m not going to worry too much   260 00:24:47,440 --> 00:24:52,320 about those existing zones. I’m just going to draw  a single zone over the entire table, hit “T” on my   261 00:24:52,320 --> 00:24:58,640 keyboard and open up the Table Editor. Once I  open up the Table Editor, I’m going to then use   262 00:24:58,640 --> 00:25:04,880 my detection tools here. So again, the computer  vision and machine learning to redefine the table   263 00:25:04,880 --> 00:25:10,400 structure. So the individual rows and columns  and cells have been all identified through our   264 00:25:10,400 --> 00:25:16,560 artificial intelligence. And then I can actually  use the spanning feature to span across here for   265 00:25:16,560 --> 00:25:21,440 these various years. I’m just holding Shift and  pressing “S” on my keyboard. And then this is an   266 00:25:21,440 --> 00:25:26,960 example of a table where I actually have two rows  of column headers so 2017 and 18. And then the   267 00:25:26,960 --> 00:25:33,200 various Q1s through Q4s… these are also going to  be read as column headers. So, I will just change   268 00:25:33,200 --> 00:25:39,440 my column header from the standard one to two. And  then if I look at my HTML preview of this table,   269 00:25:40,080 --> 00:25:45,520 what I see is a nice clean accessible HTML table.  And then the great part of Equidox is that it will   270 00:25:45,520 --> 00:25:52,000 take this table and it will automatically convert  it into a PDF tag tree. Now when I save this table   271 00:25:52,000 --> 00:25:57,360 and close out, all of those individual zones that  were previously there have been artifacted. So I’m   272 00:25:57,360 --> 00:26:04,000 just left with the single Table zone. So, the last  thing I need to do now is just reorder the page.   273 00:26:04,000 --> 00:26:08,400 Because I’ve removed some zones and I’ve drawn  some zones manually, my reading order is a little   274 00:26:08,400 --> 00:26:12,960 bit out of sync. So, all I’m going to do is go  to the Page Tab and press “Reorder.” And I can   275 00:26:12,960 --> 00:26:17,680 always check my HTML preview just to validate  that I’m satisfied with everything that I see.   276 00:26:17,680 --> 00:26:21,200 This all looks pretty good to me, so what  I’ll do is I will just save the page.   277 00:26:22,160 --> 00:26:27,040 I can mark it as validated if I'd like to, just  to remind myself that I’ve already finished this>   278 00:26:27,040 --> 00:26:33,360 And if I go to the Output Tab, I’m then able to  generate the PDF. Now when I generate the PDF,   279 00:26:33,360 --> 00:26:38,800 hopefully I’m not going to get any errors, and  in fact I did not. But now the PDF opens up in   280 00:26:38,800 --> 00:26:43,280 a separate tab for me in my browser. If you can  tell, nothing is visually changed about this   281 00:26:43,280 --> 00:26:48,080 document. It's exactly the same, but all of the  tag structure that I’ve just set up during that   282 00:26:48,080 --> 00:26:55,600 remediation process is going to be automatically  converted into those PDF tags for me. Now with   283 00:26:55,600 --> 00:27:00,960 just a few minutes left, I want to take a look at  a bit of a longer document. And this is a document   284 00:27:00,960 --> 00:27:08,320 here where it's quite a bit older. It's been it  was created in 2007, and actually this document   285 00:27:08,320 --> 00:27:14,240 has an OCR page. So this kind of incorporates a  few different elements into it one place. I like   286 00:27:14,240 --> 00:27:18,640 to start, when working with longer documents… Of  course I’ll take a look at the thumbnails for each   287 00:27:18,640 --> 00:27:24,160 of the pages and just try to get an understanding  for what is this document actually made up of.   288 00:27:24,800 --> 00:27:30,080 But then one thing I like to look at is the Images  Tab. Now the Images Tab, I can see here there are   289 00:27:30,080 --> 00:27:34,800 quite a few images in this document. And they are  somewhat technical, because these are screenshots   290 00:27:34,800 --> 00:27:40,640 and sort of instructional images. And some of the  images already have alt text, whereas others do   291 00:27:40,640 --> 00:27:44,640 not. So this is something to pay attention to  as you're going through page by page, making   292 00:27:44,640 --> 00:27:50,560 sure that your images have all been described or  in some cases artifacted if you do not feel that   293 00:27:50,560 --> 00:27:55,840 they need to be given a description. So this is  something where I’m not really able to tell, based   294 00:27:55,840 --> 00:28:00,480 on the context of the document, what this image is  trying to represent. So I don't feel comfortable   295 00:28:00,480 --> 00:28:05,760 providing alt text quite yet. I’ll wait till I get  through the document and go through page by page.   296 00:28:07,360 --> 00:28:12,880 Since I have OCR here on this very first page,  I’m just going to use my Zone Detector. This   297 00:28:12,880 --> 00:28:19,680 Zone Detection tool is something that we have not  yet demoed, but it's definitely a huge part of the   298 00:28:19,680 --> 00:28:25,120 Equidox system is using that Zone Detection  tool. And you can redefine the granularity of   299 00:28:25,120 --> 00:28:28,960 the reading zones. I have this background image,  so I’m going to just get rid of that background   300 00:28:28,960 --> 00:28:33,360 image, and I’m going to OCR all zones. So, I’m  going to convert this scanned page. If you can   301 00:28:33,360 --> 00:28:38,400 tell, this is actually scanned I’m actually going  to convert it now into selectable searchable   302 00:28:38,400 --> 00:28:43,520 usable text. And once I’ve done that, I can then  set my heading level. So this is a heading level   303 00:28:43,520 --> 00:28:48,720 one, because it's on the very first page, nice big  bold font. And you can check your HTML preview.   304 00:28:49,760 --> 00:28:56,240 So I’m pretty satisfied with that page I can then  move on to the next. Now if we had the if we had   305 00:28:56,240 --> 00:29:01,040 more time I would basically just be working  through this document very quickly. A page   306 00:29:01,040 --> 00:29:06,400 like this, where I had existing tag structure of  one big P tag covering the entire page… this can   307 00:29:06,400 --> 00:29:11,360 be very quickly adjusted, setting my headings  and setting my list down here. So if I just   308 00:29:11,360 --> 00:29:16,240 move my List Detector from left to right that will  define this list with all the individual elements.   309 00:29:17,040 --> 00:29:23,920 I can check my HTML preview… perhaps I want to  get rid of these headers and footers, for example,   310 00:29:23,920 --> 00:29:28,880 like for the revision date… This is the type  of information that can just be very redundant…   311 00:29:29,440 --> 00:29:32,880 So, using the Ignore feature as  I mentioned before, if I just   312 00:29:32,880 --> 00:29:37,680 draw a zone that's intersecting with that zone,  I press “I” on my keyboard, that will become an   313 00:29:37,680 --> 00:29:43,760 Ignore zone. And then if I go back to the preview,  the revision date is no longer part of the tagged   314 00:29:45,040 --> 00:29:49,200 elements. So if I go forward into the  document and I look at other pages,   315 00:29:50,160 --> 00:29:57,360 the Ignore zone is already waiting for me. So it's  already there, ready to remove that revision date.   316 00:29:58,480 --> 00:30:05,600 Now this is a 14- page document, and working  through it I would spend probably about 30 seconds   317 00:30:05,600 --> 00:30:10,720 per page on average to get through this. So it  would be about a five- to ten-minute document   318 00:30:10,720 --> 00:30:16,880 to get through using some of the steps that  I’ve already explained and described. You know,   319 00:30:16,880 --> 00:30:22,800 the Ignore feature, setting up a heading template,  using the auto detection feature of the zones,   320 00:30:23,360 --> 00:30:28,480 you can very quickly get through content  like this. Because we've taken the time   321 00:30:28,480 --> 00:30:32,400 to be a little bit more strategic in how we  approach it. So you don't have to remember   322 00:30:32,400 --> 00:30:35,760 to remove this on every page, you don't  have to set your headings on every page,   323 00:30:35,760 --> 00:30:40,720 because you can do it programmatically through  the heading template. But this is what a sort of a   324 00:30:41,760 --> 00:30:46,160 more refined workflow can look like when  you're working through longer documents.   325 00:30:46,160 --> 00:30:50,640 And even longer than 14 pages. Many times, you  might have documents that are hundreds of pages,   326 00:30:50,640 --> 00:30:56,000 and so the time savings can really be  multiplied over those various pages.   327 00:30:56,960 --> 00:31:01,680 And then when you export a document like this,  (keeping in mind that I have not addressed every   328 00:31:01,680 --> 00:31:07,440 page, this is going to actually produce some error  messages for me) so this is where I just wanted to   329 00:31:07,440 --> 00:31:12,160 show you that, for example, those missing images  and as well as some other elements that I have   330 00:31:12,160 --> 00:31:18,000 not yet touched, these are going to populate in  this in this error list for me. And if I were   331 00:31:18,000 --> 00:31:22,560 actually remediating this document, I would be  able to go to the list of errors or warnings,   332 00:31:22,560 --> 00:31:27,040 and I'd be able to click on these warning messages  and go directly to the page where they're located,   333 00:31:27,600 --> 00:31:32,720 fix them and then regenerate the PDF. Now I  realize that we are just about out of time,   334 00:31:32,720 --> 00:31:36,480 and I want to be respectful of everyone else's  time, so I’m actually going to just jump back   335 00:31:36,480 --> 00:31:41,840 to the slide deck here. And I will just say  thank you again for everyone for joining today.   336 00:31:42,480 --> 00:31:48,160 As I said before if you have any specific  questions or feedback, we would love to chat   337 00:31:48,160 --> 00:31:55,040 with you and talk about how Equidox can fit into  your PDF remediation workflow. We can do another   338 00:31:55,040 --> 00:32:00,720 more specific demonstration, working through some  of your documents if you'd like, and talk about   339 00:32:00,720 --> 00:32:07,440 how to define and refine a workflow for yourself.  So please feel free to visit us at www.Equidox.co   340 00:32:07,440 --> 00:32:13,840 or email EquidoxSales@Onixnet.com. So thank  you everyone and we will see you next month!  341 00:32:13,840 --> 00:32:19,600 [Tammy Albee] I will just interject that we've had  a large number of questions during this webinar. I   342 00:32:19,600 --> 00:32:26,240 believe I have answered all of them, but if anyone  had more questions or needs a personalized demo to   343 00:32:26,240 --> 00:32:33,280 get more questions answered and see the software  in action, maybe in a little more detail, we are   344 00:32:33,280 --> 00:32:39,360 more than happy to schedule that. Or you can email  us at EquidoxSales@Onix.net.com, as you see on the   345 00:32:39,360 --> 00:32:44,880 screen, and we'll be happy to answer any of those  additional questions you might have. Thanks Dan!  346 00:32:44,880 --> 00:32:55,840 [Dan Tuleta] Thanks everyone!  Have a great rest of your day!