1 00:00:03,840 --> 00:00:09,280 All right so it's just about two o'clock. So  welcome in everyone. Thank you for attending   2 00:00:09,280 --> 00:00:15,200 another edition of Equidox Webinar Wednesdays.  This month we are going to be talking about the   3 00:00:15,200 --> 00:00:22,320 Truth About Auto-Tagging. So a lot of people that  we speak to seem to have a pretty heavy reliance   4 00:00:22,320 --> 00:00:29,440 on Adobe Acrobat's auto-tagging feature. Which  is not really intended to make documents fully   5 00:00:29,440 --> 00:00:33,840 accessible. It's really just designed to  get you started on the path of remediating   6 00:00:33,840 --> 00:00:38,880 that PDF. So we want to talk a little bit  about how that works, and its limitations,   7 00:00:39,760 --> 00:00:45,760 and different ways of going about using Equidox  to fix the issues that auto-tagging can create.   8 00:00:46,800 --> 00:00:50,800 So as always I'd just like to mention  that if you would like to contact us   9 00:00:51,440 --> 00:00:59,280 feel free to reach out to EquidoxSales@  Onixnet.com. Our website is www.Equidox.co. 10 00:01:00,160 --> 00:01:05,280 We're also very active on social media, so if  you would like to connect with us on LinkedIn   11 00:01:05,280 --> 00:01:09,920 or Twitter, or Facebook we do post a lot of  information and articles. So you can always   12 00:01:09,920 --> 00:01:14,640 stay up to date with what's happening with  the Equidox team. So with that said let's   13 00:01:14,640 --> 00:01:21,600 get started. I'm just going to quickly mention a  couple of things about our parent company. So Onix   14 00:01:22,240 --> 00:01:27,760 is mainly a cloud consultancy. And we're most  known for our long-standing partnership with   15 00:01:27,760 --> 00:01:33,760 Google. And Equidox is a software product that  is owned and developed by Onix, along with other   16 00:01:33,760 --> 00:01:39,280 professional services based mainly around PDF  accessibility, and our mission here with the   17 00:01:39,280 --> 00:01:44,480 Equidox team is to ensure that digital information  reaches everyone via accessibility solutions. 18 00:01:46,880 --> 00:01:55,760 So just a quick slide here about some clients of  ours. We do work with notable names of various   19 00:01:55,760 --> 00:02:01,040 sizes and across all major verticals whether  it be education, government, or enterprise.   20 00:02:01,760 --> 00:02:05,520 So we are here to help you regardless  of how big or small your company is.   21 00:02:06,240 --> 00:02:09,040 So please don't feel that you're  too big or too small to work with   22 00:02:09,040 --> 00:02:12,000 us. We are here to help you with all  of your digital accessibility needs. 23 00:02:13,760 --> 00:02:20,000 Okay, so what we're going to be doing today is  we’ve just put together a very stock sample PDF   24 00:02:20,000 --> 00:02:27,280 document. This is one quick... (Oh, and I should  mention before we go any further that there are a   25 00:02:27,280 --> 00:02:31,440 series of polls that we are going to be asking  during the webinar, so please feel free to   26 00:02:31,440 --> 00:02:37,680 answer those. Just if you have a moment just to  answer the little survey questions that pop up,   27 00:02:37,680 --> 00:02:40,880 that's greatly appreciated. That will  help us kind of tailor these webinars   28 00:02:41,520 --> 00:02:46,400 moving forward, to make sure that we're addressing  any questions or topics that you would like to see   29 00:02:46,400 --> 00:02:52,320 us cover.) But what we're going to do today is  take a look here at this sample PDF document.   30 00:02:52,960 --> 00:02:58,080 And so this document that we've put together,  it kind of just contains a bunch of the common   31 00:02:58,080 --> 00:03:03,520 elements that you would find in any given PDF.  So you'll see things like text, headings, lists,   32 00:03:03,520 --> 00:03:10,160 tables, images, OCR, and of course the complicated  task of setting the reading order for different   33 00:03:10,160 --> 00:03:15,520 pages. So we'll be kind of working through some  of these different elements and talking about   34 00:03:15,520 --> 00:03:21,600 ways that auto-tagging can go wrong, and  ways of fixing it after the fact in Equidox. 35 00:03:23,760 --> 00:03:29,120 Okay, so the first attempt that we made  at auto-tagging this simple document here,   36 00:03:29,120 --> 00:03:35,280 the simple one-page document… For whatever  reason, Adobe Acrobat decided to tag   37 00:03:35,280 --> 00:03:41,920 the entire document as one big image. So this  entire page, despite the fact that it has   38 00:03:41,920 --> 00:03:48,560 embedded text on it, and is full of all kinds of  different elements such as of course the text,   39 00:03:48,560 --> 00:03:52,560 the headings, the list, as I mentioned  before... Acrobat, for whatever reason,   40 00:03:52,560 --> 00:03:57,920 just decided that this whole thing is just going  to be one big image. So there's a lot of problems   41 00:03:57,920 --> 00:04:03,920 with tagging an entire document like this with  just something as simple as an image. Because   42 00:04:03,920 --> 00:04:11,840 the image itself is useless to a screen reader  user. There need to be tags around the content to   43 00:04:11,840 --> 00:04:15,920 give something for the screen reader to actually  interact with and navigate around the page.   44 00:04:16,640 --> 00:04:22,000 So there is no text-based content for the screen  reader to read from in this current state. So this   45 00:04:22,000 --> 00:04:27,840 of course would need to be fixed or changed or at  the very least another attempt at auto-tagging it.   46 00:04:29,120 --> 00:04:33,440 So the second try, when we tried to auto-tag  this again, and this kind of speaks to the   47 00:04:33,440 --> 00:04:40,240 inconsistency and the lack of reliability that  you'll get with auto-tagging in Acrobat… So the   48 00:04:40,240 --> 00:04:45,280 second attempt, we actually were able to  get some tags created around the content.   49 00:04:45,920 --> 00:04:49,200 So some of the elements were tagged.  Some of the elements were missed.   50 00:04:49,840 --> 00:04:54,240 But in the GIF, if you see on the left-hand  side here, we're kind of scrolling through   51 00:04:54,240 --> 00:05:00,480 the tag panel in Acrobat… And you can see that  it's pretty complicated-looking. It's messy. But   52 00:05:00,480 --> 00:05:06,160 this is what an auto-generated tag structure  can look like. So if you're a novice user,   53 00:05:06,160 --> 00:05:11,440 if you don't know exactly what you're doing here,  trying to understand the complexities of this tag   54 00:05:11,440 --> 00:05:15,680 structure on the left-hand side and how this  relates to making the document accessible...   55 00:05:17,520 --> 00:05:23,120 This can be a tall task. And especially if  you're trying to work through dozens of pages,   56 00:05:23,120 --> 00:05:27,040 dozens of documents, it can be a very  tedious and time-consuming process.   57 00:05:28,160 --> 00:05:33,360 So the tags that we're looking at here in  the GIF are entirely inaccurate and this   58 00:05:33,360 --> 00:05:38,080 document is really a long way away from being  accessible and usable to a screen reader user.   59 00:05:38,960 --> 00:05:43,200 And like I said before, the goal of  auto-tagging is really just to get you started.   60 00:05:43,760 --> 00:05:49,280 But it is inconsistent. It is unreliable. And  it definitely requires human intervention after   61 00:05:49,280 --> 00:05:53,760 the fact to fix the many elements that  will be missed or inaccurately tagged. 62 00:05:57,200 --> 00:06:03,760 So the next logical step, assuming that you're a  novice remediator, you've hit the auto-tag button   63 00:06:03,760 --> 00:06:09,200 and now you've got some tags… The next logical  step would be to run the accessibility checker.   64 00:06:09,760 --> 00:06:16,000 Now it's kind of funny because Acrobat is checking  its own work. Acrobat applied the auto-tagging,   65 00:06:16,000 --> 00:06:21,040 and now we're using Acrobat again to check  the accessibility of those tags. So Acrobat   66 00:06:21,040 --> 00:06:26,640 here is checking its own work. And even when it's  applying its own tag structures to this document,   67 00:06:27,200 --> 00:06:32,960 it's still locating multiple errors in the tag  structure. Keeping in mind that there are also   68 00:06:32,960 --> 00:06:38,880 many other errors in this tag structure that the  accessibility checker is incapable of identifying,   69 00:06:39,520 --> 00:06:44,640 there are many things that require human  intervention to identify and locate,   70 00:06:44,640 --> 00:06:49,760 and of course, go back and correct… So  if you are an experienced remediator   71 00:06:50,400 --> 00:06:55,840 you would be able to interact with this tag  structure and navigate through the tag panel.   72 00:06:55,840 --> 00:07:00,640 And you would find dozens of issues that  would need to be corrected. And the only   73 00:07:00,640 --> 00:07:04,560 way that they can be corrected is through manual  intervention. You cannot just continue to hit   74 00:07:04,560 --> 00:07:11,440 the auto-tag button and find ways of fixing  it. There is no magic solution. You simply   75 00:07:11,440 --> 00:07:15,840 have to work through the elements one by one and  correct all of the errors that have been created. 76 00:07:21,760 --> 00:07:30,640 Let's see here… There we go, so one of the first  errors that we're going to point out here in this   77 00:07:30,640 --> 00:07:37,760 auto-tag structure is the text error. So if you  notice at the top of the page, all of that content   78 00:07:37,760 --> 00:07:44,160 across the two columns where we have two separate  basic contact information or letterhead for the   79 00:07:44,160 --> 00:07:49,280 chief information officer and the chief executive  officer of this fake company that we've invented…   80 00:07:50,000 --> 00:07:56,400 It's going to group all of that text together  inside of one tag. And the screen reader would be   81 00:07:56,400 --> 00:08:02,080 instructed by the tag structure to read directly  across the two columns which are separating the   82 00:08:02,080 --> 00:08:07,680 two groups of contact information. So if you were  a screen reader user, it would be very confusing,   83 00:08:07,680 --> 00:08:12,800 because you would be hearing both names. You would  hear Joe Smith and Bob Jones read directly after   84 00:08:12,800 --> 00:08:17,360 each other. Then both addresses... then both phone  numbers… so on and so forth, all the way down   85 00:08:17,360 --> 00:08:22,400 these two separate columns. So it would render  those two bits of contact information pretty   86 00:08:22,400 --> 00:08:26,960 much useless to a screen reader user because  it would not be read in the correct order. And   87 00:08:26,960 --> 00:08:32,160 this is something that of course Acrobat just did  automatically. so it needs to be fixed manually. 88 00:08:35,760 --> 00:08:42,880 So another error that we are able to easily  locate here is a heading error. So within   89 00:08:42,880 --> 00:08:49,360 logical heading structure, the first heading in a  document needs to be set as a heading level one…   90 00:08:50,000 --> 00:08:56,160 And for some reason, Acrobat decided to tag this  very obvious heading level one... the top portion   91 00:08:56,160 --> 00:09:02,640 of the page as in as a heading level three. So  this is objectively incorrect. The first heading   92 00:09:02,640 --> 00:09:08,720 in a document needs to be a heading level one.  Adobe knows this, and still, their auto-tagger   93 00:09:09,280 --> 00:09:14,880 is kind of just you know grasping at straws as  it's applying tags to these different elements.   94 00:09:14,880 --> 00:09:21,600 So it assigns arbitrarily an H3 to this element,  despite the fact that it has to be a heading level   95 00:09:21,600 --> 00:09:28,720 one according to all accessibility rules. So the  reason it's setting it as an H3 instead of an H1,   96 00:09:29,600 --> 00:09:35,120 it's anyone's guess, but it's probably due to  the font size or the font style that was used.   97 00:09:35,120 --> 00:09:39,440 But whatever the case may be, it does need to be  corrected. And it can only be corrected manually.   98 00:09:41,440 --> 00:09:48,800 Okay, so the next major error that we're  going to see is another heading error. But   99 00:09:48,800 --> 00:09:54,560 this is a nesting error. So the second heading  on this document has been identified as an H2.   100 00:09:55,280 --> 00:10:00,640 Now you cannot have an H2 directly following  an H3 so this needs to be fixed as well. 101 00:10:04,160 --> 00:10:10,080 Another problem here is that we have this element  on the page which is actually a block of text, but   102 00:10:10,080 --> 00:10:15,120 it is sat inside of an image. So this is something  that can happen depending on the way that the   103 00:10:15,120 --> 00:10:21,040 document was designed. So if there is text locked  inside of an image that is just recognized as an   104 00:10:21,040 --> 00:10:26,480 image, as far as a screen reader is concerned. So  OCR is required (Optical character recognition)   105 00:10:26,480 --> 00:10:32,880 to actually convert that text into something that  is usable and can be read by a screen reader. Now   106 00:10:32,880 --> 00:10:39,360 Acrobat is able to locate this figure tag, however  the content inside of it it needs to be converted   107 00:10:39,360 --> 00:10:43,360 into something that is actually embedded text,  that can be read aloud by a screen reader.   108 00:10:44,080 --> 00:10:50,320 However, using OCR like on this one particular  page, when we're this far down into the actual   109 00:10:50,320 --> 00:10:56,000 remediation of it, can be a little bit risky.  There are strange occurrences of Acrobat trying to   110 00:10:56,000 --> 00:11:02,880 re-OCR the entire page or block the user from  continuing to tag the rest of the elements   111 00:11:02,880 --> 00:11:06,800 underneath it for the rest of the  document. So this is something that   112 00:11:06,800 --> 00:11:11,200 is it's kind of a challenge to deal with  in Acrobat, and it needs to be corrected,   113 00:11:11,200 --> 00:11:16,560 however. So definitely another little challenging  issue here with just the plain old auto-tagging.   114 00:11:18,000 --> 00:11:22,480 Now another obvious area, as we just continue to  work down the page, every single element you'll   115 00:11:22,480 --> 00:11:27,040 notice seems to have something wrong with it…  And this is just the nature of auto-tagging…   116 00:11:27,040 --> 00:11:32,880 So the next problem is the lists. So lists are  almost never tagged correctly from auto-tagging.   117 00:11:33,440 --> 00:11:38,880 And the entire list of these seven different  items, it's going to be read and grouped together   118 00:11:38,880 --> 00:11:43,600 as a paragraph. So all seven of those list  items are just going to be read as a strange   119 00:11:44,160 --> 00:11:51,120 run-on sentence. And doing this in Acrobat is  extremely tedious. It requires dozens of steps,   120 00:11:51,120 --> 00:11:56,320 and even for a small simple list like this,  there are so many clicks and steps associated   121 00:11:56,320 --> 00:12:01,200 with tagging it correctly that if you're just  a novice user and you don't know exactly how to   122 00:12:01,200 --> 00:12:04,960 navigate the complexities of this  Acrobat program in the tag panel,   123 00:12:05,600 --> 00:12:10,800 you can have a very very difficult time just  trying to tag this one very simple list. 124 00:12:14,000 --> 00:12:19,920 The next thing I want to point out is the table.  So tables are almost never correctly auto-tagged,   125 00:12:19,920 --> 00:12:26,320 and they also require manual fixes of course. So  the table cannot be left in this auto-tag state,   126 00:12:26,320 --> 00:12:30,320 because the screen reader is just going to be...  it's going to read straight across the cells   127 00:12:30,880 --> 00:12:35,840 without any indication of how the user is  currently oriented in terms of rows and   128 00:12:35,840 --> 00:12:40,640 columns and like where that screen reader  is actually focused on. So it renders the   129 00:12:40,640 --> 00:12:46,640 entire table useless because it simply cannot be  navigated as a table is intended to be navigated. 130 00:12:48,960 --> 00:12:52,240 So how do we go about fixing  this? So we've pointed out   131 00:12:52,240 --> 00:12:57,920 obviously at least half a dozen major flaws with  the auto-tagging of just this one simple page.   132 00:12:58,480 --> 00:13:05,040 So what we wanted to show you today is just  opening up Equidox, our PDF remediation software,   133 00:13:05,040 --> 00:13:12,240 and we are going to use the different features in  the platform to fix these types of issues. We're   134 00:13:12,240 --> 00:13:18,960 going to use things like our Zone Detection tools,  which is effectively its own type of auto-tagging,   135 00:13:18,960 --> 00:13:24,080 where we're able to group together and find  different clusters of content to get the user   136 00:13:24,080 --> 00:13:29,200 started. But of course, we have a much easier  interface and a much easier workflow to navigate   137 00:13:29,200 --> 00:13:35,520 through to actually make these corrections to the  document itself. So this list of bullet points is   138 00:13:35,520 --> 00:13:40,240 just a handful of things that we are going to  do as we go through the Equidox demonstration. 139 00:13:42,640 --> 00:13:47,040 I'm going to back out of the slide deck here,  and I'm actually going to open up Equidox.   140 00:13:47,040 --> 00:13:53,200 And I already have the document pre-loaded in  my Equidox account. So if you take a look here,   141 00:13:53,840 --> 00:13:59,280 this is what the document looks like when we  just simply import it into Equidox. so we have   142 00:14:00,080 --> 00:14:04,880 the tag structure that came from the original  this is Adobe's auto-tagging. If we remember,   143 00:14:04,880 --> 00:14:10,960 that issue before of the content being read  straight across the two separate columns up here…   144 00:14:11,600 --> 00:14:18,480 We have our heading level three, our heading level  two, our text zone capturing this entire list,   145 00:14:18,480 --> 00:14:22,880 this image of text is is going to be  a problem as well… And then down here,   146 00:14:22,880 --> 00:14:28,400 we have this table, which is going to be read  simply as text itself. So before we make any   147 00:14:28,400 --> 00:14:33,200 changes or alterations to this, if you've  never seen an Equidox demonstration before,   148 00:14:33,920 --> 00:14:37,440 if I press this button up here which  looks a bit like a computer monitor,   149 00:14:38,160 --> 00:14:43,840 this will actually open a separate tab in my  browser. As soon as my browser wakes up...   150 00:14:45,600 --> 00:14:51,600 Interesting... there we go! So it will open a  separate tab in my browser, and what I can see   151 00:14:51,600 --> 00:14:58,880 here is an HTML rendering of the page that I'm  currently working on. So the HTML is, basically,   152 00:14:59,600 --> 00:15:05,680 what it's doing, is it is showing me exactly how a  screen reader is going to interact with these tags   153 00:15:05,680 --> 00:15:12,320 as they currently sit. So if I take a look here,  if we remember the issue with the text jumping   154 00:15:12,320 --> 00:15:17,840 across the two columns, a screen reader would read  that as Joe Smith Bob Jones and then would read   155 00:15:17,840 --> 00:15:23,120 their titles… It would read their corporations  back to back, it would read their headquarters,   156 00:15:23,120 --> 00:15:27,520 and their different addresses in terms of  the street address, the city, and the state,   157 00:15:27,520 --> 00:15:32,560 their phone numbers… This is very convoluted  and confusing and not at all intended how it   158 00:15:32,560 --> 00:15:37,360 should be read by a screen reader. I'm working  down the page, you can see other issues here…   159 00:15:37,360 --> 00:15:42,160 We have our list down here at the very bottom,  which is going to be just read as a strange   160 00:15:42,160 --> 00:15:48,800 paragraph with little stars inserted inside of  it for list labels. And then this is actually   161 00:15:48,800 --> 00:15:52,720 what our table looks like currently. So  our table is also just going to be read   162 00:15:52,720 --> 00:15:57,760 as a strange run-on sentence that will make no  sense to a screen reader user… not even being   163 00:15:57,760 --> 00:16:02,960 aware that this is supposed to be a table until we  tag it as such. So what I'll do is, I'm going to   164 00:16:02,960 --> 00:16:10,320 come back into the actual Equidox Page Editor, and  I'm going to make some adjustments to this page.   165 00:16:10,320 --> 00:16:15,600 the first thing that I want to adjust is the  page. If I go to the Page Tab, what I'll do is I'm   166 00:16:15,600 --> 00:16:20,960 going to use the Zone Detector tool. So the Zone  Detector, what this is designed to do is ignore   167 00:16:20,960 --> 00:16:26,960 the existing tag structure, so that tag structure  that Acrobat gave us with the auto-tag feature   168 00:16:26,960 --> 00:16:31,680 that's not helping us at all... And I don't want  to try to work with those tags. I don't want to   169 00:16:31,680 --> 00:16:37,360 have to undo all of the mistakes that it made.  So we can let Equidox just redefine this page   170 00:16:37,360 --> 00:16:42,880 by using this Zone Detection slider. So if I move  this slider back and forth, left to right, you'll   171 00:16:42,880 --> 00:16:47,760 instantly see how we've cleaned things up pretty  significantly. We've gotten rid of all of those   172 00:16:47,760 --> 00:16:52,480 extra tags that were just really kind of getting  in the way. And now we're left with a much cleaner   173 00:16:52,480 --> 00:16:59,680 starting point that doesn't require as many manual  adjustments. So if I were to go into the HTML   174 00:16:59,680 --> 00:17:05,040 Preview again, already I've made some improvements  simply by the fact that I've grouped together Joe   175 00:17:05,040 --> 00:17:08,960 Smith and Bob Jones's contact information.  At least now it's going to be read properly.   176 00:17:09,760 --> 00:17:14,240 So I do have to make some additional adjustments,  of course. We have things like images, we have   177 00:17:14,240 --> 00:17:18,640 heading structure, we have to apply break lines  to these addresses, we have the list, and the   178 00:17:18,640 --> 00:17:24,160 table and we also have the OCR. So let's just  start working through the page, top to bottom.   179 00:17:25,040 --> 00:17:29,120 Up here in the top margin, we have just the  version number of this document. Things like   180 00:17:29,120 --> 00:17:34,160 this can often be artifacted because they're  not necessarily adding much value to the tag   181 00:17:34,160 --> 00:17:38,160 structure. So I can start by just hitting  backspace to get rid of that extra zone.   182 00:17:38,720 --> 00:17:43,360 I have this image here, which I'll just begin  with because it's the next tallest zone. But   183 00:17:43,360 --> 00:17:47,360 what I like...what I can do is actually apply  an alt description to it. So I can type in   184 00:17:47,360 --> 00:17:56,640 “letterhead mark maker sample company logo.” So  I'm able to apply an alt text to that image so   185 00:17:56,640 --> 00:18:01,760 that it will actually have some context for the  screen reader user as to what this image is. The   186 00:18:01,760 --> 00:18:07,680 contact information for Joe Smith and Bob Jones...  What I want this to do is, I want to apply a small   187 00:18:07,680 --> 00:18:12,800 pause at the end of each one of these lines so  that their contact information will be read in a   188 00:18:12,800 --> 00:18:17,280 bit smoother of a flow for the end-user, rather  than having it all just be treated as one big   189 00:18:17,280 --> 00:18:23,280 sentence. So if I apply the break lines checkbox  to these two different elements, that is going to   190 00:18:23,280 --> 00:18:28,800 improve the end-user’s experience as they read  through this page. The next thing I'm going to   191 00:18:28,800 --> 00:18:34,240 do is focus on my heading structure. So if I click  on this item here, where it says “Memorandum,” if   192 00:18:34,240 --> 00:18:40,960 we remember from the auto-tagging, this was set as  an H3. This is our first heading on the page so we   193 00:18:40,960 --> 00:18:46,880 really don't want this to be set as an H3. We want  this to be set as an H1 because it is of course   194 00:18:47,440 --> 00:18:52,080 a rule for our logical heading structure that  all documents have to have a heading level one.   195 00:18:52,080 --> 00:18:56,160 And that heading level one should be the first  heading on the page. So all I have to do to set   196 00:18:56,160 --> 00:19:01,360 this as a heading level one is simply tap “1” on  my keyboard. Now I've made some adjustments to   197 00:19:01,360 --> 00:19:05,760 the page so far, and just so that we don't get  too far ahead of ourselves if I press the HTML   198 00:19:05,760 --> 00:19:10,000 Preview again... I mean this is not something  that's required... you don't have to press the   199 00:19:10,000 --> 00:19:14,960 HTML Preview every time you make a change... It's  just simply something to show you kind of how   200 00:19:14,960 --> 00:19:19,200 things are evolving here in terms of how a screen  reader is going to interact with this document   201 00:19:19,760 --> 00:19:25,680 as we go. So if you can tell, now our actual  contact information for the Joe Smith and Bob   202 00:19:25,680 --> 00:19:29,360 Jones... they've actually been treated a little  bit differently. Where each one of those lines   203 00:19:29,360 --> 00:19:33,600 of text is going to be read with a slight pause  at the end of it, just making it a bit easier   204 00:19:33,600 --> 00:19:41,520 to understand. For example, where the phone number  starts versus where the actual area code ends. The   205 00:19:41,520 --> 00:19:47,520 memorandum is now a much larger font because I've  set this as an H1. So you can see that change   206 00:19:47,520 --> 00:19:52,640 reflected here in your HTML Preview, and then  everything else below we have not yet impacted.   207 00:19:52,640 --> 00:19:56,720 So we still need to make some changes down here.  But you can kind of see how things are changing as   208 00:19:56,720 --> 00:20:04,480 we go. So if I come back into the main page here,  I want to capture this “What is Lorem Ipsum” as an   209 00:20:04,480 --> 00:20:09,680 H2. So zones can be easily adjusted. If I  just resize that zone and drag it up, I can   210 00:20:09,680 --> 00:20:15,600 then grab a new zone, or grab this content by just  clicking and dragging to capture that text again.   211 00:20:15,600 --> 00:20:19,840 And then if I just simply hit “2” on my keyboard,  that will identify that as a heading level two.   212 00:20:21,440 --> 00:20:26,480 Down below here, here's that bit of OCR. So as  you can see, right now it's sat inside of an   213 00:20:26,480 --> 00:20:31,120 image zone. So what I can do is just get rid of  that image zone by just hitting backspace on my   214 00:20:31,120 --> 00:20:37,360 keyboard. And if I just draw a new zone on top of  that text, it's now a text zone. But I can change   215 00:20:37,360 --> 00:20:42,880 the zone source to OCR. OCR, of course, standing  for optical character recognition. If I hit the   216 00:20:42,880 --> 00:20:48,240 “Convert to Text” button here, this is going  to process this basically a screenshot of this   217 00:20:48,240 --> 00:20:53,040 area of the page, and it is actually going to turn  it into something here in this little text field   218 00:20:53,040 --> 00:20:58,080 where you have real encoded text that  can be actually read by a screen reader.   219 00:20:58,080 --> 00:21:03,120 The alternative to that would be literally typing  out every single word of this in the form of an   220 00:21:03,120 --> 00:21:07,760 alt text for that image. That's going to take  you a lot longer than simply just pressing the   221 00:21:07,760 --> 00:21:12,960 “Convert to Text” button in Equidox. Over  here we have another heading level two,   222 00:21:12,960 --> 00:21:18,400 so I'm just going to tap “2” on my keyboard. And  this is identifying where this list is located, so   223 00:21:18,400 --> 00:21:22,560 this list of course needs to be addressed right  now. Again it's just sat there as a text zone,   224 00:21:22,560 --> 00:21:26,240 so it's going to read this as just one big  run-on sentence, which is going to be strange   225 00:21:26,880 --> 00:21:31,680 and not make much sense to an end-user. So what  we can do to fix this if I just hit “L” on my   226 00:21:31,680 --> 00:21:37,360 keyboard that will change the zone to a list, and  then I can use the List Detection Slider. If I use   227 00:21:37,360 --> 00:21:42,400 the List Detection Slider, Equidox is able to  automatically identify where those list items   228 00:21:42,400 --> 00:21:48,640 are located. And it is basically doing all of the  heavy lifting for me. So it's going to change what   229 00:21:48,640 --> 00:21:54,640 was something that is simply a text zone, it's now  going to tag this as a list, and when I export the   230 00:21:54,640 --> 00:21:59,280 document, Equidox goes through the steps of  building out the complicated tag structure   231 00:21:59,280 --> 00:22:04,160 that is required for this list. So that I don't  have to do that manually. If you remember before,   232 00:22:04,160 --> 00:22:08,800 I was talking about how a simple little list like  this has only, I don't know, eight items in it,   233 00:22:08,800 --> 00:22:13,840 there are so many steps required to properly  tag this. It's very manual, it's very technical,   234 00:22:14,480 --> 00:22:19,600 and you have to really have a lot of patience and  just a deep understanding of what you're doing to   235 00:22:19,600 --> 00:22:23,840 be able to efficiently work through lists like  this. So with Equidox, it's simply a matter of   236 00:22:23,840 --> 00:22:28,400 identifying that this zone is a list, and moving  the slider from left to right, and you're done.   237 00:22:29,440 --> 00:22:33,120 The next thing we want to worry about  here is the table. So as you can see,   238 00:22:33,120 --> 00:22:38,560 after I've used that Zone Detector the table is  still not identified as a table. It's just kind of   239 00:22:38,560 --> 00:22:44,000 grouping the individual columns and rows together  in their own individual text zones. So I can fix   240 00:22:44,000 --> 00:22:49,360 this by simply ignoring all of those zones and  just drawing a single zone over the entire table.   241 00:22:49,360 --> 00:22:54,720 And if I hit “T” on my keyboard, that will change  the zone type to Table. And I can now open up the   242 00:22:54,720 --> 00:23:01,120 Table Editor. So the Table Editor is going to  isolate the table for me. And if you notice,   243 00:23:01,120 --> 00:23:05,520 the great green grid lines here, they're kind of  identifying where the rows are. It's trying to   244 00:23:05,520 --> 00:23:10,080 figure out the basic structure of this table.  So this is the initial guess that Equidox has   245 00:23:10,080 --> 00:23:14,960 given me. And it did an okay job. I can always  make manual adjustments to this if I'd like to,   246 00:23:14,960 --> 00:23:21,520 but I prefer to use the Table Detector.  So using our artificial intelligence,   247 00:23:21,520 --> 00:23:26,640 a combination of machine learning and computer  vision, if I move these sliders around,   248 00:23:26,640 --> 00:23:31,520 Equidox is going to trigger that AI to  automatically identify the layout of this table.   249 00:23:32,320 --> 00:23:37,520 Now this table has only column headers and no row  headers, so what I'm going to do is change my row   250 00:23:37,520 --> 00:23:43,760 header from the default one (we have a default one  to one ratio, meaning that you have your top row   251 00:23:43,760 --> 00:23:48,160 will be your column headers and the farthest left  column will be your row headers). In this case,   252 00:23:48,160 --> 00:23:52,240 I'm going to change this, instead of these being  row headers, I'm going to just change them to   253 00:23:52,240 --> 00:23:57,760 table data. So if I just change my one to zero,  that will update my table summary automatically,   254 00:23:57,760 --> 00:24:02,080 which is another thing that Equidox can automate  for you. Automatically putting together your   255 00:24:02,080 --> 00:24:08,080 table summaries based on how you have structured  your table. And the green grid lines to locate   256 00:24:08,080 --> 00:24:13,280 the different cells. So after doing that, if I  take a look at this preview of just the table,   257 00:24:13,920 --> 00:24:19,520 we'll see that before I had a very strange sort of  like series of paragraphs like in run-on sentences   258 00:24:19,520 --> 00:24:23,840 trying to read through this table. But I've  actually very quickly just converted it into   259 00:24:23,840 --> 00:24:28,720 a nice clean table. And again the beauty  of Equidox is that it will take this table   260 00:24:28,720 --> 00:24:33,760 that I've built here, and I can see in my HTML  Preview, and it is automatically going to build   261 00:24:33,760 --> 00:24:42,400 the table tag structure for me when I export the  document as a PDF. So if I save this table now and   262 00:24:42,400 --> 00:24:47,840 close out, I will return back to the Page Editor.  And all of those zones that were underneath   263 00:24:47,840 --> 00:24:52,000 the individual text zones, that were underneath,  they have been overwritten. I'm just now left   264 00:24:52,000 --> 00:24:58,160 with one single table zone. Now if I take a  look at the HTML Preview for the full page,   265 00:24:58,160 --> 00:25:04,240 what I'll see is things are looking a lot better.  But I do have some reading order issues. If you   266 00:25:04,240 --> 00:25:10,160 remember, I created this zone manually. This was  the OCR zone. And when I created it manually,   267 00:25:10,160 --> 00:25:14,960 it took over the lowest available number in  the reading zone. So it jumped up to number one   268 00:25:14,960 --> 00:25:19,520 because I deleted the zone that was capturing this  little thing up here in the header telling me what   269 00:25:19,520 --> 00:25:25,200 version of this document this is. So to fix the  reading order, it's really straightforward. I can   270 00:25:25,200 --> 00:25:30,400 simply go to the Page Tab and press the “Reorder”  button. And that is going to reorder things in a   271 00:25:30,400 --> 00:25:35,280 top-to-bottom, left-to-right sequence. So the  first thing that will be read is my image here   272 00:25:35,280 --> 00:25:40,800 because this is the tallest zone… Then I have the  Joe Smith and the Bob Jones contact information,   273 00:25:40,800 --> 00:25:46,320 my heading 1, and working down the page just as  a top-to-bottom natural layout. If I go back to   274 00:25:46,320 --> 00:25:50,320 the HTML Preview, this will be a little bit  more reflective of what I would like to see.   275 00:25:50,320 --> 00:25:55,040 Where I have down here... I've got my list and  my tables, everything kind of in the order that I   276 00:25:55,040 --> 00:26:00,720 would expect to see it. And one other little thing  that you might choose to adjust... best practice   277 00:26:00,720 --> 00:26:05,440 would call to have your heading level 1 be the  first thing read on the page whenever possible.   278 00:26:05,440 --> 00:26:09,840 So in this case, just to show you another  way to impact the reading order… This is   279 00:26:09,840 --> 00:26:14,960 currently reading zone four. So if I wanted  to make this the first zone read on the page,   280 00:26:14,960 --> 00:26:19,520 you might think that I would then need to  make turn one into two, and two into three,   281 00:26:19,520 --> 00:26:23,920 and three into four… Just so that I can make room  for this. But we can actually use the decimal   282 00:26:23,920 --> 00:26:28,560 system. So if I just put a period over here in  the little order field... if I put a period in   283 00:26:28,560 --> 00:26:32,880 front of the 4 that will make that zone 0.4.  And that is going to reorder it and put it at   284 00:26:32,880 --> 00:26:37,760 the very top of the reading order because 0.4 is  of course less than 1. So I don't have to have a   285 00:26:37,760 --> 00:26:42,560 perfect sequence of numbers. I'm able to impact  the reading order of this page very quickly.   286 00:26:42,560 --> 00:26:48,240 And now if I take a look at just the full sort  of final HTML version of this document, I'm very   287 00:26:48,240 --> 00:26:53,600 satisfied with what I see. Everything here is  basically exactly what I would expect and how I   288 00:26:53,600 --> 00:26:58,720 would anticipate a screen reader should be reading  through this page. So I'm done with this document,   289 00:26:58,720 --> 00:27:05,120 and I can save it and I can close out at any time.  And once I'm done closing out, I could of course   290 00:27:05,120 --> 00:27:10,800 export the document as a PDF simply by going  to the Output Tab and hitting “Generate PDF.”   291 00:27:12,000 --> 00:27:15,840 Now in the interest of time let's jump  back into the slide presentation here. 292 00:27:19,920 --> 00:27:23,920 So we've covered the Equidox demonstration,  and again if there's anything else that you'd   293 00:27:23,920 --> 00:27:30,560 like to see... if you'd like to have a more like  one-on-one type of demonstration where we take a   294 00:27:30,560 --> 00:27:35,440 look at your own documents, your organization's  documents, or talk about the existing workflows   295 00:27:35,440 --> 00:27:39,760 that you have, please feel free to reach out to  us through our contact information which I'll   296 00:27:39,760 --> 00:27:45,040 definitely put up again on the screen. But it's  definitely something that we would love to chat   297 00:27:45,600 --> 00:27:49,600 with different organizations about. We'd love to  know how you're using PDFs, how you're currently   298 00:27:49,600 --> 00:27:53,760 remediating, or if you're not remediating at  all. And if you'd like to incorporate a tool   299 00:27:53,760 --> 00:28:01,200 like Equidox into your workflow. So just in  full disclosure, we are, of course, just to   300 00:28:01,200 --> 00:28:06,880 keep things kind of simplified, we are of course  just using screenshots. But this is a screenshot   301 00:28:06,880 --> 00:28:14,160 of the accessibility report after we have  remediated that document. So just to be honest,   302 00:28:14,160 --> 00:28:19,200 we're not actually opening up Adobe here on the  slide presentation. We just took a screenshot of   303 00:28:19,200 --> 00:28:23,920 another version of this one that we did prior to  the demonstration today. But if you take a look   304 00:28:23,920 --> 00:28:29,360 here at the Tag Panel, now on the left-hand side..  This is after running the accessibility checker,   305 00:28:29,360 --> 00:28:34,720 and there are only two issues located. But keep  in mind that these issues will be present in every   306 00:28:34,720 --> 00:28:41,360 document that you ever remediate and ever attempt  to validate in Acrobat because Acrobat is not   307 00:28:41,360 --> 00:28:47,200 capable of checking for the logical reading order  or the color contrast of the document. So these   308 00:28:47,200 --> 00:28:52,800 are really just sort of warnings or reminders  that those types of elements need manual checks.   309 00:28:52,800 --> 00:28:56,160 But we have already kind of handled that,  especially from the reading order standpoint,   310 00:28:56,800 --> 00:29:01,920 by working through the document in Equidox and  checking things with our HTML Preview. For the   311 00:29:01,920 --> 00:29:06,640 color contrast, you can always use a third-party  color contrast-specific checking tool.   312 00:29:07,440 --> 00:29:11,120 But these are things that are always going to  be present. But if you can tell now you have   313 00:29:11,120 --> 00:29:16,480 a very clean-looking accessibility report  and this document is now perfectly usable,   314 00:29:16,480 --> 00:29:20,160 perfectly accessible, and ready  to be posted or distributed to   315 00:29:21,680 --> 00:29:29,120 your website, or any of your clients, or wherever  it may be off to. So the just a quick reminder,   316 00:29:29,120 --> 00:29:33,440 a couple of the things that we covered in the  Equidox demonstration. We went through a couple   317 00:29:33,440 --> 00:29:39,440 of our Smart Detection Tools. One click to set  our heading levels, we were able to very quickly   318 00:29:39,440 --> 00:29:43,840 adjust our reading order, so there's a number  of things that we'd love to chat with you   319 00:29:43,840 --> 00:29:47,920 more on a one-on-one basis. If you'd like to  have your own personal demonstration of Equidox,   320 00:29:47,920 --> 00:29:51,680 again please get in touch with us. And you  can get in touch with us by reaching out to   321 00:29:51,680 --> 00:29:59,200 EquidoxSales@Onixnet.com. Check out our website  Equidox.co and of course, we are always active   322 00:29:59,920 --> 00:30:06,400 on social media. So please feel free to connect  with us there. So it is 2:30 on the dot. So I   323 00:30:06,400 --> 00:30:10,320 want to be respectful of everyone's time. I  really appreciate everyone attending today,   324 00:30:10,320 --> 00:30:21,840 and come visit us again next month for our  next edition of Equidox Webinar Wednesdays.