1 00:00:00,250 --> 00:00:09,930 [Dan Tuleta] All right, well welcome everyone to another edition of Equidox Webinar Wednesdays. 2 00:00:09,930 --> 00:00:13,530 It's just about two o'clock, so I think we should get started. 3 00:00:13,530 --> 00:00:18,100 Now for anyone that has joined us before, we really appreciate you continuing to attend 4 00:00:18,100 --> 00:00:23,599 these webinars to learn a bit more about document remediation and Equidox. 5 00:00:23,599 --> 00:00:30,430 So today's presentation is going to be talking about refining and designing a workflow for 6 00:00:30,430 --> 00:00:33,930 attacking these documents from a remediation standpoint. 7 00:00:33,930 --> 00:00:43,320 So as always please feel free to reach out to us at any given time through Our website 8 00:00:43,320 --> 00:00:47,110 is www.Equidox.co. 9 00:00:47,110 --> 00:00:52,390 So we do encourage follow-up questions or feedback for these webinars and if you'd like 10 00:00:52,390 --> 00:00:57,230 to see any personalized demonstrations of Equidox, maybe more tailored to your specific 11 00:00:57,230 --> 00:01:00,930 workflow, please feel free to reach out to us. 12 00:01:00,930 --> 00:01:05,460 We would love to start a conversation with you. 13 00:01:05,460 --> 00:01:12,330 Now in terms of document remediation, if you are assigned with remediating documents… 14 00:01:12,330 --> 00:01:17,460 if this is your everyday job… or if this is just a one-off project, maybe that that 15 00:01:17,460 --> 00:01:23,979 you've been asked to complete as a content creator, there's a few things to consider 16 00:01:23,979 --> 00:01:25,380 in the pre-flight process. 17 00:01:25,380 --> 00:01:30,830 So before you get started with remediating a document, one of the first things that I 18 00:01:30,830 --> 00:01:36,570 like to look at is how many pages are there, If there are just one or a couple of pages 19 00:01:36,570 --> 00:01:42,760 in this document, you know, then then you can kind of just power through it and just 20 00:01:42,760 --> 00:01:47,000 kind of approach it by just opening the document up and taking a look at what you have and 21 00:01:47,000 --> 00:01:48,210 start working on it. 22 00:01:48,210 --> 00:01:55,479 Now if there are a lot of pages in this document, typically I like to take a few minutes to 23 00:01:55,479 --> 00:01:58,149 just investigate what the document is compiled of. 24 00:01:58,149 --> 00:02:03,070 Because you might choose to be a little bit more strategic and tactical in how you approach 25 00:02:03,070 --> 00:02:04,549 that document. 26 00:02:04,549 --> 00:02:08,091 Because there could be a number of things that you could consolidate… 27 00:02:08,091 --> 00:02:13,720 steps along the way that will eliminate redundancies and, you know, forcing you to repeat steps 28 00:02:13,720 --> 00:02:15,480 page after page. 29 00:02:15,480 --> 00:02:20,410 So we'll take a look at a few of those things later on during the demonstration. 30 00:02:20,410 --> 00:02:24,330 Now another thing to consider is how complex is the content? 31 00:02:24,330 --> 00:02:31,270 Now a document might have a lot of pages, but if you consider a document maybe that's 32 00:02:31,270 --> 00:02:36,360 just all text, for example, where you just have a series of paragraphs page after page 33 00:02:36,360 --> 00:02:37,630 after page… 34 00:02:37,630 --> 00:02:41,840 The great thing about Equidox is its ability to auto-detect that type of content. 35 00:02:41,840 --> 00:02:45,220 And it's going to do a lot of the heavy lifting for you. 36 00:02:45,220 --> 00:02:50,900 So the complexity of the content does have a lot to do with your workflow as you remediate. 37 00:02:50,900 --> 00:02:57,581 Now a 100-page document made up entirely of text elements could be easier than a 10-page 38 00:02:57,581 --> 00:02:59,970 document although it's much longer. 39 00:02:59,970 --> 00:03:06,659 A 10-page document might have very technical engineering drawings, or very complex elements 40 00:03:06,659 --> 00:03:11,160 that take more actual work to tag properly. 41 00:03:11,160 --> 00:03:14,420 So something to look at is the complexity of the content. 42 00:03:14,420 --> 00:03:18,810 To kind of build out the workflow in your head before you get started. 43 00:03:18,810 --> 00:03:23,560 Another thing that I look at is, is the design and the formatting of the document consistent 44 00:03:23,560 --> 00:03:24,560 throughout? 45 00:03:24,560 --> 00:03:27,690 So is this document coming from a template? 46 00:03:27,690 --> 00:03:32,230 Is there a consistent use of heading structure throughout? 47 00:03:32,230 --> 00:03:35,599 Are the lists properly formatted throughout the document? 48 00:03:35,599 --> 00:03:38,840 How consistent is this page after page? 49 00:03:38,840 --> 00:03:42,260 Sometimes I see documents which are almost like a Frankenstein document. 50 00:03:42,260 --> 00:03:49,400 Where it's a series of pages that have come together from various locations. 51 00:03:49,400 --> 00:03:51,049 There's nothing consistent about them. 52 00:03:51,049 --> 00:03:56,940 Every page is unique, and every page has to be treated like as its own individual puzzle. 53 00:03:56,940 --> 00:04:02,800 So it's just important to note if there's consistent formatting throughout the document. 54 00:04:02,800 --> 00:04:08,579 There are ways within Equidox to consolidate steps and to eliminate some redundancies so 55 00:04:08,579 --> 00:04:12,409 that you don't have to repeat the same exact steps page after page. 56 00:04:12,409 --> 00:04:18,549 Now another thing that I look at always, is if there is existing tag structure associated 57 00:04:18,549 --> 00:04:20,320 with the document. 58 00:04:20,320 --> 00:04:23,850 And existing tag structure could be good or bad. 59 00:04:23,850 --> 00:04:28,880 Now many documents, depending on where they come from… for example, Microsoft Word being 60 00:04:28,880 --> 00:04:34,380 a very common one, when you hit “Save as PDF” coming out of Microsoft Word oftentimes 61 00:04:34,380 --> 00:04:40,400 there is an automated tagging process where Microsoft Word will attempt to convert that 62 00:04:40,400 --> 00:04:47,699 word file into a PDF and generate some automated tags throughout that process. 63 00:04:47,699 --> 00:04:52,860 Now that can be good or bad, depending on the way that the document was put together. 64 00:04:52,860 --> 00:04:58,550 But the point is that you don't really know what you're going to get during that conversion 65 00:04:58,550 --> 00:05:01,930 process from Microsoft Word into PDF. 66 00:05:01,930 --> 00:05:07,720 Because how those how those tags are generated is basically a mystery to the user. 67 00:05:07,720 --> 00:05:10,270 You just have to kind of wait and see what you get. 68 00:05:10,270 --> 00:05:13,610 There are of course many things that you can take into consideration when you're designing 69 00:05:13,610 --> 00:05:20,491 your documents to improve the accessibility of them when you do convert them into a PDF. 70 00:05:20,491 --> 00:05:25,730 But they still need to be looked at because PDF and Word are different formats and they 71 00:05:25,730 --> 00:05:30,520 just have different ways of marking up documents from an accessibility standpoint. 72 00:05:30,520 --> 00:05:34,919 So just because your Microsoft Word document is technically accessible, it does not mean 73 00:05:34,919 --> 00:05:38,500 that the PDF that that generates is going to be automatically accessible. 74 00:05:38,500 --> 00:05:44,050 So you still need to take into consideration that the existing tags that were generated 75 00:05:44,050 --> 00:05:49,660 automatically through Microsoft Word, they might still need to be touched up or adjusted. 76 00:05:49,660 --> 00:05:56,850 Now there are other documents that a that may have been remediated at some point in 77 00:05:56,850 --> 00:06:03,380 their life cycle where someone might have opened the document in Adobe Acrobat and pressed 78 00:06:03,380 --> 00:06:08,610 auto tag and just kind of crossed their fingers and hoped that they got some sort of usable 79 00:06:08,610 --> 00:06:10,520 tag structure. 80 00:06:10,520 --> 00:06:16,199 This can be something that is going to cause you more problems than be of service. 81 00:06:16,199 --> 00:06:17,729 So these are just things to look at. 82 00:06:17,729 --> 00:06:20,389 If you have existing tag structure. 83 00:06:20,389 --> 00:06:25,380 What is the quality of those existing tags, and are they worth maintaining, or is it better 84 00:06:25,380 --> 00:06:28,800 to just use Equidox and start from scratch? 85 00:06:28,800 --> 00:06:34,420 Because of the automation and all of the tools that Equidox gives you to tag documents, there 86 00:06:34,420 --> 00:06:41,099 are many instances where it can be easier to just start over rather than trying to rescue 87 00:06:41,099 --> 00:06:45,480 poor existing tags. 88 00:06:45,480 --> 00:06:49,509 Another feature that or another aspect that I like to look at is does the document require 89 00:06:49,509 --> 00:06:50,509 OCR? 90 00:06:50,509 --> 00:06:53,630 And OCR stands for Optical Character Recognition. 91 00:06:53,630 --> 00:06:59,289 This can be relevant for documents that are scanned, for example, or documents that might 92 00:06:59,289 --> 00:07:05,240 contain infographics that have text that is actually locked inside of images. 93 00:07:05,240 --> 00:07:11,770 Now OCR certainly adds another layer of complexity to remediation because in order to make these 94 00:07:11,770 --> 00:07:17,750 documents compatible with screen readers and machine-readable, an image is simply an image. 95 00:07:17,750 --> 00:07:21,940 We need to extract text from it if there is in fact text inside of it. 96 00:07:21,940 --> 00:07:28,050 So if you have a document where someone took an old newspaper and ran it through a scanner, 97 00:07:28,050 --> 00:07:31,940 technically that document can come out of that scanner as a PDF. 98 00:07:31,940 --> 00:07:36,250 But it is simply just an image as far as a machine reader is concerned. 99 00:07:36,250 --> 00:07:41,199 So that this requires an extra step called Optical Character Recognition, where you're 100 00:07:41,199 --> 00:07:47,940 actually analyzing the image and extracting the usable searchable, selectable text from 101 00:07:47,940 --> 00:07:48,940 it. 102 00:07:48,940 --> 00:07:54,389 So you have to run this OCR process to extract that text, which can take it can take a little 103 00:07:54,389 --> 00:07:58,720 bit of extra time, but this is just sort of the nature of the of the format. 104 00:07:58,720 --> 00:08:03,610 If you're dealing with scanned images then you have to take the time to make sure that 105 00:08:03,610 --> 00:08:08,690 you are extracting the content from it in order to make it machine readable. 106 00:08:08,690 --> 00:08:11,680 Another thing to consider is are there form fields present? 107 00:08:11,680 --> 00:08:17,500 So documents PDF documents that are intended to be fillable forms oftentimes will have 108 00:08:17,500 --> 00:08:19,280 form fields present within them. 109 00:08:19,280 --> 00:08:26,270 And this is another element that does take a little bit of extra care because form fields 110 00:08:26,270 --> 00:08:28,819 need to have what are called Tooltips applied to them. 111 00:08:28,819 --> 00:08:32,190 So if you find that there are form fields in the document, that's another thing to be 112 00:08:32,190 --> 00:08:36,690 aware of and another thing to sort of budget your time and your workflow around… making 113 00:08:36,690 --> 00:08:42,800 sure that you that you take the time to properly handle those form fields. 114 00:08:42,800 --> 00:08:46,700 And then one of the last things that I like to look at are the images. 115 00:08:46,700 --> 00:08:51,399 So if there are images present in the document, which many times there will be, how many images 116 00:08:51,399 --> 00:08:52,420 are there? 117 00:08:52,420 --> 00:08:54,260 About how many images per page are there? 118 00:08:54,260 --> 00:09:00,279 I mean, are these images informative are they technical or are they something that are just 119 00:09:00,279 --> 00:09:01,279 decorative? 120 00:09:01,279 --> 00:09:04,650 Now decorative images could be things like repetitive logos… 121 00:09:04,650 --> 00:09:09,050 they could be like watermark… or background images… 122 00:09:09,050 --> 00:09:11,160 they could be stock images. 123 00:09:11,160 --> 00:09:19,779 So some sort of brochure where you just have two people bending over a computer smiling 124 00:09:19,779 --> 00:09:21,660 at the computer screen… 125 00:09:21,660 --> 00:09:26,410 These are sorts of like stock images that are not actually adding any content a contextual 126 00:09:26,410 --> 00:09:29,050 value to the document itself. 127 00:09:29,050 --> 00:09:33,269 They're simply there for the aesthetics and to take up some space and maybe balance out 128 00:09:33,269 --> 00:09:34,610 the page. 129 00:09:34,610 --> 00:09:40,880 So not all images require alt text, but that's up to the remediator to determine… 130 00:09:40,880 --> 00:09:45,300 If there are images, what kind of alt text and what kind of effort do you have to make 131 00:09:45,300 --> 00:09:47,530 to apply that alt text? 132 00:09:47,530 --> 00:09:51,410 Perhaps the image… the document is full of technical images. 133 00:09:51,410 --> 00:09:56,640 Maybe this is like engineering content where you have a bunch of complex diagrams and graphs 134 00:09:56,640 --> 00:09:57,790 and charts. 135 00:09:57,790 --> 00:10:01,410 If you're just the remediator and you're not the subject matter expert, you might need 136 00:10:01,410 --> 00:10:06,220 to incorporate someone who knows more about that subject than you to help you with the 137 00:10:06,220 --> 00:10:07,270 alt text writing. 138 00:10:07,270 --> 00:10:13,000 So these are just things to consider as you are going through your remediation and as 139 00:10:13,000 --> 00:10:18,380 you're developing your workflow throughout that document. 140 00:10:18,380 --> 00:10:21,140 Now some collaboration ideas! 141 00:10:21,140 --> 00:10:26,269 Because Equidox is a web-based application, there's a collaboration aspect built into 142 00:10:26,269 --> 00:10:27,790 the tool. 143 00:10:27,790 --> 00:10:33,980 So we can actually work together and share documents through the Equidox platform. 144 00:10:33,980 --> 00:10:39,040 And because we work on a concurrent user licensing model, that means you can have multiple people 145 00:10:39,040 --> 00:10:43,769 logged into the account, into the same account simultaneously, and you can even have multiple 146 00:10:43,769 --> 00:10:48,540 people working together on the same document simultaneously. 147 00:10:48,540 --> 00:10:53,910 So Equidox gives you the ability to share documents with other users or groups and you 148 00:10:53,910 --> 00:10:55,589 can actually divide and conquer. 149 00:10:55,589 --> 00:11:00,269 So let's say you have a 200-page document and it's got a very tight deadline. 150 00:11:00,269 --> 00:11:03,209 You know, you'd like to get it posted by the end of the day. 151 00:11:03,209 --> 00:11:08,320 Well rather than one person having to hunch over their computer all day and kind of panic 152 00:11:08,320 --> 00:11:12,950 about this very large dense document, you might have five different users and everyone 153 00:11:12,950 --> 00:11:14,480 takes 40 pages. 154 00:11:14,480 --> 00:11:18,540 And then you can kind of power through that document in an hour or two, rather than having 155 00:11:18,540 --> 00:11:22,750 one person have to stress out and clear their calendar to make sure that they can get through 156 00:11:22,750 --> 00:11:23,750 it. 157 00:11:23,750 --> 00:11:26,829 So this is all made possible because this is a cloud-based application. 158 00:11:26,829 --> 00:11:32,250 Being able to share the document and even have multiple users working together simultaneously 159 00:11:32,250 --> 00:11:36,220 has a lot of advantages built into it. 160 00:11:36,220 --> 00:11:40,850 As I mentioned before, images could require subject matter experts so depending on your 161 00:11:40,850 --> 00:11:47,950 background or the nature of the document these images that are contained in the document 162 00:11:47,950 --> 00:11:53,190 might be so complex that you're not exactly sure how to provide an accurate alt description. 163 00:11:53,190 --> 00:11:58,630 So that again, the advantage of being web-based, is you could send the URL for the page over 164 00:11:58,630 --> 00:12:02,802 through an instant messenger or through an email to one of your colleagues, or one of 165 00:12:02,802 --> 00:12:07,590 your teammates who might be more familiar with the content, and just ask them to take 166 00:12:07,590 --> 00:12:11,089 a few minutes to provide alt text for those images. 167 00:12:11,089 --> 00:12:13,660 And then you can do the rest of the tagging of the documents. 168 00:12:13,660 --> 00:12:17,350 So from a structural standpoint, you can set your reading order, and your headings, and 169 00:12:17,350 --> 00:12:22,079 your lists and your tables, but those images that require that alternate text you might 170 00:12:22,079 --> 00:12:24,300 not be comfortable trying to provide that… 171 00:12:24,300 --> 00:12:27,100 If they are very technical or very complex images… 172 00:12:27,100 --> 00:12:30,170 so you can get someone else involved in that workflow. 173 00:12:30,170 --> 00:12:33,480 Because this is cloud-based, they can be working on that alt text while you are working on 174 00:12:33,480 --> 00:12:36,370 the rest of the document. 175 00:12:36,370 --> 00:12:42,380 And also in the Equidox application, there is a validation tool, and a Page Notes feature, 176 00:12:42,380 --> 00:12:47,000 so assuming that you're working together with a large team on a specific document, you can 177 00:12:47,000 --> 00:12:51,480 use the validation tool to essentially mark pages as complete. 178 00:12:51,480 --> 00:12:53,570 So that you can avoid redundancies… 179 00:12:53,570 --> 00:12:57,750 so someone isn't coming into a page and making adjustments and changing things that have 180 00:12:57,750 --> 00:13:00,880 already been marked as validated. 181 00:13:00,880 --> 00:13:06,690 And then also the Page Notes feature is a way of just leaving yourself or other users 182 00:13:06,690 --> 00:13:07,690 a note. 183 00:13:07,690 --> 00:13:13,449 Perhaps you skipped over an element because you weren't sure how to approach it, or you'd 184 00:13:13,449 --> 00:13:18,820 like to just have some sort of communication with your colleagues as to what happened on 185 00:13:18,820 --> 00:13:22,080 that specific page, or how you decided to approach it. 186 00:13:22,080 --> 00:13:28,360 Where they might be able to sort of replicate that process on other similar pages. 187 00:13:28,360 --> 00:13:32,501 And Equidox will give you a consolidated list of all of these various page notes that you 188 00:13:32,501 --> 00:13:34,850 take throughout the throughout the document. 189 00:13:34,850 --> 00:13:40,060 And whether you're an Administrator or a power user, and you have less skilled less trained 190 00:13:40,060 --> 00:13:44,410 workers working underneath you, you can very quickly look through the page notes and make 191 00:13:44,410 --> 00:13:46,260 sure that everything has been addressed. 192 00:13:46,260 --> 00:13:50,329 If there were questions, or if there were issues, you can always just follow up in the 193 00:13:50,329 --> 00:13:55,650 in the application and look at that specific page and make sure that that issue was addressed. 194 00:13:55,650 --> 00:13:59,710 And we'll go through that as well in the demonstration. 195 00:13:59,710 --> 00:14:01,459 Now eliminating repetition. 196 00:14:01,459 --> 00:14:08,220 So, if we have a lot of different tools built into Equidox that allow you to programmatically 197 00:14:08,220 --> 00:14:10,259 apply things to all pages. 198 00:14:10,259 --> 00:14:15,089 So, if you are an existing Equidox user or perhaps you've seen a demonstration before, 199 00:14:15,089 --> 00:14:18,750 you're probably familiar with our Zone detection slider. 200 00:14:18,750 --> 00:14:24,190 So this is a tool that will redefine or detect the reading zones within a document. 201 00:14:24,190 --> 00:14:28,470 And if you find that you have a pretty consistent formatting throughout the entire document, 202 00:14:28,470 --> 00:14:31,209 you can apply this to all pages. 203 00:14:31,209 --> 00:14:33,649 Now the same thing can be done for the reading order. 204 00:14:33,649 --> 00:14:39,220 So if you have, for example, every page in the document is a two-column reading order, 205 00:14:39,220 --> 00:14:43,380 you can choose two columns as that reading order and hit “Apply to all pages” and 206 00:14:43,380 --> 00:14:49,290 then Equidox will programmatically set that two column layouts to every one of those pages. 207 00:14:49,290 --> 00:14:51,550 We also have the ability to set up a heading template. 208 00:14:51,550 --> 00:14:58,110 So again, assuming that the document is consistently formatted and designed throughout, you can 209 00:14:58,110 --> 00:15:00,899 actually identify your heading hierarchy. 210 00:15:00,899 --> 00:15:06,500 So for example if you find that your heading level twos are all in the same font style, 211 00:15:06,500 --> 00:15:11,670 the same font characteristics, you can hit a simple checkbox and Equidox will programmatically 212 00:15:11,670 --> 00:15:17,089 look forward in the document and identify all of the different areas of the document 213 00:15:17,089 --> 00:15:21,461 where you have that same font style and will automatically apply that heading level two 214 00:15:21,461 --> 00:15:22,461 for you. 215 00:15:22,461 --> 00:15:26,080 And that can be done for all of your various heading levels. 216 00:15:26,080 --> 00:15:28,150 We also have an “Ignore” feature. 217 00:15:28,150 --> 00:15:34,600 The Ignore feature is the ability to bulk artifact repetitive elements in a document. 218 00:15:34,600 --> 00:15:41,230 So if you have a footer or a header (maybe it's like a revision date or a serial number, 219 00:15:41,230 --> 00:15:45,910 something that is just continuously repeating on every single page) you don't necessarily 220 00:15:45,910 --> 00:15:48,510 need to tag that on every page. 221 00:15:48,510 --> 00:15:54,160 And you can use the ignore feature, if it's located in the same location, to bulk artifacts 222 00:15:54,160 --> 00:15:57,029 all of that information throughout the entire document. 223 00:15:57,029 --> 00:16:01,500 So that you're not feeding this redundant information to the screen reader user page 224 00:16:01,500 --> 00:16:04,850 after page after page. 225 00:16:04,850 --> 00:16:07,720 Now another important part of this is to keep a rhythm. 226 00:16:07,720 --> 00:16:13,460 So, I find when I’m working on longer documents particularly, that remediation can be a very 227 00:16:13,460 --> 00:16:15,459 rhythmic process. 228 00:16:15,459 --> 00:16:20,860 So if you have similar layouts and designs throughout the document, a lot of those pages 229 00:16:20,860 --> 00:16:23,720 will require the same techniques. 230 00:16:23,720 --> 00:16:29,220 And once you get through a handful of pages, you will start to eliminate redundancies and 231 00:16:29,220 --> 00:16:31,030 consolidate steps. 232 00:16:31,030 --> 00:16:34,709 So that you can start just eliminating, literally eliminating clicks. 233 00:16:34,709 --> 00:16:39,050 And if you're if you're doing a high volume type of project where you have a lot of pages 234 00:16:39,050 --> 00:16:44,800 to get through, if you can eliminate a handful of clicks or a handful of steps on every page, 235 00:16:44,800 --> 00:16:48,829 that really adds up to a lot of time savings at the end of that document, or the end of 236 00:16:48,829 --> 00:16:49,829 that project. 237 00:16:49,829 --> 00:16:57,300 So, as you get into a rhythm, I find it's very helpful to try to stay within that rhythm. 238 00:16:57,300 --> 00:17:03,540 And then the outlier type of pages, perhaps you come across a page where you have a completely 239 00:17:03,540 --> 00:17:08,339 different layout, or there's a large infographic or a flow chart or something that is completely 240 00:17:08,339 --> 00:17:12,459 different than the pages that you've been previously working, on you can always skip 241 00:17:12,459 --> 00:17:16,590 over that page and just return to it at the very end of the process. 242 00:17:16,590 --> 00:17:21,730 So rather than disrupting that flow that you've just developed for the previous 10 or 20 pages, 243 00:17:21,730 --> 00:17:27,110 you can just skip over it and go back to that flow chart or back to that infographic that 244 00:17:27,110 --> 00:17:29,530 requires a lot of extra work. 245 00:17:29,530 --> 00:17:33,890 And then you can kind of finish off the document at the very end by addressing that outlier 246 00:17:33,890 --> 00:17:38,400 type of page at the very end of the workflow. 247 00:17:38,400 --> 00:17:41,720 Now another part of Equidox is validating as you go. 248 00:17:41,720 --> 00:17:46,179 And this is something that we will also cover in the in the demonstration when we get to 249 00:17:46,179 --> 00:17:47,179 it. 250 00:17:47,179 --> 00:17:52,140 So the HTML preview, if you've used Equidox before I’m sure you're familiar with this, 251 00:17:52,140 --> 00:17:58,299 but the HTML preview is a way of checking the structure that you've built on the page 252 00:17:58,299 --> 00:18:02,270 in terms of reading order, your headings, lists, tables. 253 00:18:02,270 --> 00:18:07,740 All of that information that you have structured in that remediation process will be previewable 254 00:18:07,740 --> 00:18:09,930 in the HTML preview page. 255 00:18:09,930 --> 00:18:15,309 And this is a way of just checking your work before moving on to a subsequent page. 256 00:18:15,309 --> 00:18:19,340 So, you'll get a lot of feedback from the application as you're going through at the 257 00:18:19,340 --> 00:18:20,580 page level. 258 00:18:20,580 --> 00:18:25,150 And if you're struggling with a certain page, you can always revisit it and finish it off 259 00:18:25,150 --> 00:18:26,150 at the very end. 260 00:18:26,150 --> 00:18:29,440 So, it's just a way of checking your work as you go. 261 00:18:29,440 --> 00:18:35,860 Also when you are finished working through a document, when you export the PDF, Equidox 262 00:18:35,860 --> 00:18:40,370 will run a series of accessibility checks on that export. 263 00:18:40,370 --> 00:18:42,360 Now the bullet points here on the slide… 264 00:18:42,360 --> 00:18:47,280 this is not a comprehensive list of everything that it is checking for, but just a few examples 265 00:18:47,280 --> 00:18:52,049 of the things that it's looking for are missing alt text for images… if you have illogical 266 00:18:52,049 --> 00:18:55,030 reading orders or logical heading structures… 267 00:18:55,030 --> 00:18:59,850 invalid merges… for example, if you are trying to merge a list with a text zone, for 268 00:18:59,850 --> 00:19:01,549 example… 269 00:19:01,549 --> 00:19:07,220 And then the errors that Equidox finds throughout this accessibility check, they are directly 270 00:19:07,220 --> 00:19:10,220 linked to the specific page where it was found on. 271 00:19:10,220 --> 00:19:13,570 And they give you a clickable link where you can just select the link and go straight to 272 00:19:13,570 --> 00:19:15,720 the page where that error was found. 273 00:19:15,720 --> 00:19:20,340 And then you can make your correction in Equidox and then just re-export the document once 274 00:19:20,340 --> 00:19:23,090 you've worked through all of those errors. 275 00:19:23,090 --> 00:19:31,440 So this accessibility check that we are applying on export, as I said it's not a fully comprehensive 276 00:19:31,440 --> 00:19:34,620 accessibility report that you are getting… 277 00:19:34,620 --> 00:19:40,770 We still highly recommend using third-party validation tools and especially screen readers. 278 00:19:40,770 --> 00:19:45,850 Screen readers to replicate exactly what a screen reader user is experiencing when they're 279 00:19:45,850 --> 00:19:47,600 working with this document. 280 00:19:47,600 --> 00:19:49,650 Now automated accessibility checking… 281 00:19:49,650 --> 00:19:54,570 it can be automated to a certain degree but there are many, many different elements that 282 00:19:54,570 --> 00:19:59,929 it can miss because it's simply looking for technical requirements. 283 00:19:59,929 --> 00:20:04,890 But it's not actually able to replicate what an end user is going through with a screen 284 00:20:04,890 --> 00:20:05,890 reader. 285 00:20:05,890 --> 00:20:10,020 So it's very important to make sure that you are that you are checking your work as you 286 00:20:10,020 --> 00:20:11,980 as you export the document. 287 00:20:11,980 --> 00:20:17,110 Putting it through various third-party tools, or using screen readers which is the best 288 00:20:17,110 --> 00:20:22,420 and the ultimate check for the actual usability of that document. 289 00:20:22,420 --> 00:20:28,600 Okay, so let's jump into Equidox now, and we'll work through a couple of examples here… 290 00:20:28,600 --> 00:20:31,450 and I realize that we’ve got about 10 minutes left. 291 00:20:31,450 --> 00:20:33,659 So I’m going to go pretty quickly through these documents. 292 00:20:33,659 --> 00:20:37,570 But that's sort of the point with Equidox, is to be able to move very efficiently through 293 00:20:37,570 --> 00:20:38,570 them. 294 00:20:38,570 --> 00:20:42,669 So I have a couple of documents pre-loaded here into Equidox and I want to start with 295 00:20:42,669 --> 00:20:43,710 this one here. 296 00:20:43,710 --> 00:20:46,210 And this is just a one-page document. 297 00:20:46,210 --> 00:20:49,740 So as I said before, when you have a one-page document, sometimes it's best to just jump 298 00:20:49,740 --> 00:20:55,530 right in and see what you have rather than trying to spend a few minutes to be very strategic 299 00:20:55,530 --> 00:20:56,840 and tactical with it. 300 00:20:56,840 --> 00:21:01,110 You can just approach this page and just see what you've got in here, and start working 301 00:21:01,110 --> 00:21:02,110 through it. 302 00:21:02,110 --> 00:21:06,640 Now what I notice right away is I have some images, I have some headings, I have text, 303 00:21:06,640 --> 00:21:11,720 I have a list here, ( in fact, a nested list), and a table down here at the very bottom. 304 00:21:11,720 --> 00:21:16,110 So just to quickly work through this, what I’m going to do is I’m going to start 305 00:21:16,110 --> 00:21:20,140 by, at the top and just start artifacting images that I don't really need. 306 00:21:20,140 --> 00:21:24,120 I don't know what this is, just a background sort of watermark type of image… 307 00:21:24,120 --> 00:21:31,880 Here's another image of black and white dog holding a briefcase… 308 00:21:31,880 --> 00:21:33,799 here's our Equidox logo up here. 309 00:21:33,799 --> 00:21:37,000 So I’m just typing in all descriptions for them. 310 00:21:37,000 --> 00:21:38,730 Now I’m going to work on my heading structures. 311 00:21:38,730 --> 00:21:39,730 So I hit… 312 00:21:39,730 --> 00:21:43,750 everything right now is currently set as a text zone because this document was untagged 313 00:21:43,750 --> 00:21:44,830 to begin with. 314 00:21:44,830 --> 00:21:49,410 So if I just start hitting my appropriate keyboard shortcuts to set my headings. 315 00:21:49,410 --> 00:21:54,510 Heading level 1, heading level 2, heading level 2, and heading level 2. 316 00:21:54,510 --> 00:21:57,580 Now I’ve defined my headings for this page. 317 00:21:57,580 --> 00:21:59,110 Next up is this list. 318 00:21:59,110 --> 00:22:04,059 As you can tell this is a nested list, and what I’ll do is I will hit “L” on my 319 00:22:04,059 --> 00:22:06,580 keyboard and I will use my List Detection slider. 320 00:22:06,580 --> 00:22:10,980 So if I move this over, Equidox is going to look inside of this list and it's going to 321 00:22:10,980 --> 00:22:14,950 pick up these nested layers inside of the individual list items. 322 00:22:14,950 --> 00:22:18,929 So if you can tell, or if you've used Equidox before, you'll notice that this list has been 323 00:22:18,929 --> 00:22:23,510 automatically detected, and the nestings have been detected inside of those individual list 324 00:22:23,510 --> 00:22:26,380 items, simply by moving this list detection slider. 325 00:22:26,380 --> 00:22:30,960 So this is using artificial intelligence and machine learning to better understand list 326 00:22:30,960 --> 00:22:34,710 structures, and something like this only takes a few seconds. 327 00:22:34,710 --> 00:22:38,590 Now this down here is a table, and if you can tell I have a bunch of different text 328 00:22:38,590 --> 00:22:43,020 zones covering up this table because it has no existing tag structure. 329 00:22:43,020 --> 00:22:46,049 So I’m not going to worry too much about those existing zones. 330 00:22:46,049 --> 00:22:50,159 I’m just going to draw a single zone over the entire table, hit “T” on my keyboard 331 00:22:50,159 --> 00:22:52,919 and open up the Table Editor. 332 00:22:52,919 --> 00:22:57,640 Once I open up the Table Editor, I’m going to then use my detection tools here. 333 00:22:57,640 --> 00:23:03,140 So again, the computer vision and machine learning to redefine the table structure. 334 00:23:03,140 --> 00:23:08,250 So the individual rows and columns and cells have been all identified through our artificial 335 00:23:08,250 --> 00:23:09,280 intelligence. 336 00:23:09,280 --> 00:23:14,980 And then I can actually use the spanning feature to span across here for these various years. 337 00:23:14,980 --> 00:23:18,010 I’m just holding Shift and pressing “S” on my keyboard. 338 00:23:18,010 --> 00:23:21,919 And then this is an example of a table where I actually have two rows of column headers 339 00:23:21,919 --> 00:23:24,100 so 2017 and 18. 340 00:23:24,100 --> 00:23:29,400 And then the various Q1s through Q4s… these are also going to be read as column headers. 341 00:23:29,400 --> 00:23:34,340 So, I will just change my column header from the standard one to two. 342 00:23:34,340 --> 00:23:39,700 And then if I look at my HTML preview of this table, what I see is a nice clean accessible 343 00:23:39,700 --> 00:23:41,090 HTML table. 344 00:23:41,090 --> 00:23:44,669 And then the great part of Equidox is that it will take this table and it will automatically 345 00:23:44,669 --> 00:23:48,160 convert it into a PDF tag tree. 346 00:23:48,160 --> 00:23:52,610 Now when I save this table and close out, all of those individual zones that were previously 347 00:23:52,610 --> 00:23:54,440 there have been artifacted. 348 00:23:54,440 --> 00:23:57,600 So I’m just left with the single Table zone. 349 00:23:57,600 --> 00:24:01,480 So, the last thing I need to do now is just reorder the page. 350 00:24:01,480 --> 00:24:05,261 Because I’ve removed some zones and I’ve drawn some zones manually, my reading order 351 00:24:05,261 --> 00:24:06,500 is a little bit out of sync. 352 00:24:06,500 --> 00:24:10,020 So, all I’m going to do is go to the Page Tab and press “Reorder.” 353 00:24:10,020 --> 00:24:14,080 And I can always check my HTML preview just to validate that I’m satisfied with everything 354 00:24:14,080 --> 00:24:15,080 that I see. 355 00:24:15,080 --> 00:24:19,450 This all looks pretty good to me, so what I’ll do is I will just save the page. 356 00:24:19,450 --> 00:24:23,270 I can mark it as validated if I'd like to, just to remind myself that I’ve already 357 00:24:23,270 --> 00:24:28,820 finished this> And if I go to the Output Tab, I’m then able to generate the PDF. 358 00:24:28,820 --> 00:24:32,890 Now when I generate the PDF, hopefully I’m not going to get any errors, and in fact I 359 00:24:32,890 --> 00:24:33,950 did not. 360 00:24:33,950 --> 00:24:38,330 But now the PDF opens up in a separate tab for me in my browser. 361 00:24:38,330 --> 00:24:41,110 If you can tell, nothing is visually changed about this document. 362 00:24:41,110 --> 00:24:45,380 It's exactly the same, but all of the tag structure that I’ve just set up during that 363 00:24:45,380 --> 00:24:52,580 remediation process is going to be automatically converted into those PDF tags for me. 364 00:24:52,580 --> 00:24:57,200 Now with just a few minutes left, I want to take a look at a bit of a longer document. 365 00:24:57,200 --> 00:25:01,289 And this is a document here where it's quite a bit older. 366 00:25:01,289 --> 00:25:07,130 It's been it was created in 2007, and actually this document has an OCR page. 367 00:25:07,130 --> 00:25:11,330 So this kind of incorporates a few different elements into it one place. 368 00:25:11,330 --> 00:25:13,960 I like to start, when working with longer documents… 369 00:25:13,960 --> 00:25:17,530 Of course I’ll take a look at the thumbnails for each of the pages and just try to get 370 00:25:17,530 --> 00:25:21,620 an understanding for what is this document actually made up of. 371 00:25:21,620 --> 00:25:24,700 But then one thing I like to look at is the Images Tab. 372 00:25:24,700 --> 00:25:29,360 Now the Images Tab, I can see here there are quite a few images in this document. 373 00:25:29,360 --> 00:25:33,570 And they are somewhat technical, because these are screenshots and sort of instructional 374 00:25:33,570 --> 00:25:34,570 images. 375 00:25:34,570 --> 00:25:38,309 And some of the images already have alt text, whereas others do not. 376 00:25:38,309 --> 00:25:41,840 So this is something to pay attention to as you're going through page by page, making 377 00:25:41,840 --> 00:25:47,380 sure that your images have all been described or in some cases artifacted if you do not 378 00:25:47,380 --> 00:25:50,860 feel that they need to be given a description. 379 00:25:50,860 --> 00:25:54,230 So this is something where I’m not really able to tell, based on the context of the 380 00:25:54,230 --> 00:25:56,669 document, what this image is trying to represent. 381 00:25:56,669 --> 00:25:59,710 So I don't feel comfortable providing alt text quite yet. 382 00:25:59,710 --> 00:26:03,870 I’ll wait till I get through the document and go through page by page. 383 00:26:03,870 --> 00:26:10,159 Since I have OCR here on this very first page, I’m just going to use my Zone Detector. 384 00:26:10,159 --> 00:26:14,690 This Zone Detection tool is something that we have not yet demoed, but it's definitely 385 00:26:14,690 --> 00:26:20,510 a huge part of the Equidox system is using that Zone Detection tool. 386 00:26:20,510 --> 00:26:23,590 And you can redefine the granularity of the reading zones. 387 00:26:23,590 --> 00:26:27,059 I have this background image, so I’m going to just get rid of that background image, 388 00:26:27,059 --> 00:26:28,090 and I’m going to OCR all zones. 389 00:26:28,090 --> 00:26:30,450 So, I’m going to convert this scanned page. 390 00:26:30,450 --> 00:26:34,299 If you can tell, this is actually scanned I’m actually going to convert it now into 391 00:26:34,299 --> 00:26:37,340 selectable searchable usable text. 392 00:26:37,340 --> 00:26:39,950 And once I’ve done that, I can then set my heading level. 393 00:26:39,950 --> 00:26:44,250 So this is a heading level one, because it's on the very first page, nice big bold font. 394 00:26:44,250 --> 00:26:46,110 And you can check your HTML preview. 395 00:26:46,110 --> 00:26:51,190 So I’m pretty satisfied with that page I can then move on to the next. 396 00:26:51,190 --> 00:26:56,380 Now if we had the if we had more time I would basically just be working through this document 397 00:26:56,380 --> 00:26:57,420 very quickly. 398 00:26:57,420 --> 00:27:03,240 A page like this, where I had existing tag structure of one big P tag covering the entire 399 00:27:03,240 --> 00:27:07,690 page… this can be very quickly adjusted, setting my headings and setting my list down 400 00:27:07,690 --> 00:27:08,690 here. 401 00:27:08,690 --> 00:27:12,330 So if I just move my List Detector from left to right that will define this list with all 402 00:27:12,330 --> 00:27:14,390 the individual elements. 403 00:27:14,390 --> 00:27:20,740 I can check my HTML preview… perhaps I want to get rid of these headers and footers, for 404 00:27:20,740 --> 00:27:23,460 example, like for the revision date… 405 00:27:23,460 --> 00:27:26,370 This is the type of information that can just be very redundant… 406 00:27:26,370 --> 00:27:31,670 So, using the Ignore feature as I mentioned before, if I just draw a zone that's intersecting 407 00:27:31,670 --> 00:27:35,799 with that zone, I press “I” on my keyboard, that will become an Ignore zone. 408 00:27:35,799 --> 00:27:41,190 And then if I go back to the preview, the revision date is no longer part of the tagged 409 00:27:41,190 --> 00:27:43,299 elements. 410 00:27:43,299 --> 00:27:49,480 So if I go forward into the document and I look at other pages, the Ignore zone is already 411 00:27:49,480 --> 00:27:51,260 waiting for me. 412 00:27:51,260 --> 00:27:55,830 So it's already there, ready to remove that revision date. 413 00:27:55,830 --> 00:28:02,640 Now this is a 14- page document, and working through it I would spend probably about 30 414 00:28:02,640 --> 00:28:05,340 seconds per page on average to get through this. 415 00:28:05,340 --> 00:28:10,820 So it would be about a five- to ten-minute document to get through using some of the 416 00:28:10,820 --> 00:28:14,030 steps that I’ve already explained and described. 417 00:28:14,030 --> 00:28:18,430 You know, the Ignore feature, setting up a heading template, using the auto detection 418 00:28:18,430 --> 00:28:24,950 feature of the zones, you can very quickly get through content like this. 419 00:28:24,950 --> 00:28:28,520 Because we've taken the time to be a little bit more strategic in how we approach it. 420 00:28:28,520 --> 00:28:32,010 So you don't have to remember to remove this on every page, you don't have to set your 421 00:28:32,010 --> 00:28:35,980 headings on every page, because you can do it programmatically through the heading template. 422 00:28:35,980 --> 00:28:41,780 But this is what a sort of a more refined workflow can look like when you're working 423 00:28:41,780 --> 00:28:43,559 through longer documents. 424 00:28:43,559 --> 00:28:45,270 And even longer than 14 pages. 425 00:28:45,270 --> 00:28:49,460 Many times, you might have documents that are hundreds of pages, and so the time savings 426 00:28:49,460 --> 00:28:54,409 can really be multiplied over those various pages. 427 00:28:54,409 --> 00:28:58,610 And then when you export a document like this, (keeping in mind that I have not addressed 428 00:28:58,610 --> 00:29:04,070 every page, this is going to actually produce some error messages for me) so this is where 429 00:29:04,070 --> 00:29:08,600 I just wanted to show you that, for example, those missing images and as well as some other 430 00:29:08,600 --> 00:29:13,120 elements that I have not yet touched, these are going to populate in this in this error 431 00:29:13,120 --> 00:29:14,470 list for me. 432 00:29:14,470 --> 00:29:18,890 And if I were actually remediating this document, I would be able to go to the list of errors 433 00:29:18,890 --> 00:29:23,600 or warnings, and I'd be able to click on these warning messages and go directly to the page 434 00:29:23,600 --> 00:29:27,390 where they're located, fix them and then regenerate the PDF. 435 00:29:27,390 --> 00:29:31,230 Now I realize that we are just about out of time, and I want to be respectful of everyone 436 00:29:31,230 --> 00:29:35,419 else's time, so I’m actually going to just jump back to the slide deck here. 437 00:29:35,419 --> 00:29:39,880 And I will just say thank you again for everyone for joining today. 438 00:29:39,880 --> 00:29:45,350 As I said before if you have any specific questions or feedback, we would love to chat 439 00:29:45,350 --> 00:29:50,750 with you and talk about how Equidox can fit into your PDF remediation workflow. 440 00:29:50,750 --> 00:29:56,480 We can do another more specific demonstration, working through some of your documents if 441 00:29:56,480 --> 00:30:01,850 you'd like, and talk about how to define and refine a workflow for yourself. 442 00:30:01,850 --> 00:30:04,880 For more information about how Equidox Software Company can help you 443 00:30:04,880 --> 00:30:10,799 with PDF accessibility Email us at EquidoxSales@equidox.co 444 00:30:10,799 --> 00:30:14,540 Or give us a call at 216-529-3030 445 00:30:14,540 --> 00:30:19,450 Or visit our website at www.equidox.co