1
00:00:00,250 --> 00:00:09,930
[Dan Tuleta] All right, well welcome everyone
to another edition of Equidox Webinar Wednesdays.

2
00:00:09,930 --> 00:00:13,530
It's just about two o'clock, so I think we
should get started.

3
00:00:13,530 --> 00:00:18,100
Now for anyone that has joined us before,
we really appreciate you continuing to attend

4
00:00:18,100 --> 00:00:23,599
these webinars to learn a bit more about document
remediation and Equidox.

5
00:00:23,599 --> 00:00:30,430
So today's presentation is going to be talking
about refining and designing a workflow for

6
00:00:30,430 --> 00:00:33,930
attacking these documents from a remediation
standpoint.

7
00:00:33,930 --> 00:00:43,320
So as always please feel free to reach out
to us at any given time through Our website

8
00:00:43,320 --> 00:00:47,110
is www.Equidox.co.

9
00:00:47,110 --> 00:00:52,390
So we do encourage follow-up questions or
feedback for these webinars and if you'd like

10
00:00:52,390 --> 00:00:57,230
to see any personalized demonstrations of
Equidox, maybe more tailored to your specific

11
00:00:57,230 --> 00:01:00,930
workflow, please feel free to reach out to
us.

12
00:01:00,930 --> 00:01:05,460
We would love to start a conversation with
you.

13
00:01:05,460 --> 00:01:12,330
Now in terms of document remediation, if you
are assigned with remediating documents…

14
00:01:12,330 --> 00:01:17,460
if this is your everyday job… or if this
is just a one-off project, maybe that that

15
00:01:17,460 --> 00:01:23,979
you've been asked to complete as a content
creator, there's a few things to consider

16
00:01:23,979 --> 00:01:25,380
in the pre-flight process.

17
00:01:25,380 --> 00:01:30,830
So before you get started with remediating
a document, one of the first things that I

18
00:01:30,830 --> 00:01:36,570
like to look at is how many pages are there,
If there are just one or a couple of pages

19
00:01:36,570 --> 00:01:42,760
in this document, you know, then then you
can kind of just power through it and just

20
00:01:42,760 --> 00:01:47,000
kind of approach it by just opening the document
up and taking a look at what you have and

21
00:01:47,000 --> 00:01:48,210
start working on it.

22
00:01:48,210 --> 00:01:55,479
Now if there are a lot of pages in this document,
typically I like to take a few minutes to

23
00:01:55,479 --> 00:01:58,149
just investigate what the document is compiled
of.

24
00:01:58,149 --> 00:02:03,070
Because you might choose to be a little bit
more strategic and tactical in how you approach

25
00:02:03,070 --> 00:02:04,549
that document.

26
00:02:04,549 --> 00:02:08,091
Because there could be a number of things
that you could consolidate…

27
00:02:08,091 --> 00:02:13,720
steps along the way that will eliminate redundancies
and, you know, forcing you to repeat steps

28
00:02:13,720 --> 00:02:15,480
page after page.

29
00:02:15,480 --> 00:02:20,410
So we'll take a look at a few of those things
later on during the demonstration.

30
00:02:20,410 --> 00:02:24,330
Now another thing to consider is how complex
is the content?

31
00:02:24,330 --> 00:02:31,270
Now a document might have a lot of pages,
but if you consider a document maybe that's

32
00:02:31,270 --> 00:02:36,360
just all text, for example, where you just
have a series of paragraphs page after page

33
00:02:36,360 --> 00:02:37,630
after page…

34
00:02:37,630 --> 00:02:41,840
The great thing about Equidox is its ability
to auto-detect that type of content.

35
00:02:41,840 --> 00:02:45,220
And it's going to do a lot of the heavy lifting
for you.

36
00:02:45,220 --> 00:02:50,900
So the complexity of the content does have
a lot to do with your workflow as you remediate.

37
00:02:50,900 --> 00:02:57,581
Now a 100-page document made up entirely of
text elements could be easier than a 10-page

38
00:02:57,581 --> 00:02:59,970
document although it's much longer.

39
00:02:59,970 --> 00:03:06,659
A 10-page document might have very technical
engineering drawings, or very complex elements

40
00:03:06,659 --> 00:03:11,160
that take more actual work
to tag properly.

41
00:03:11,160 --> 00:03:14,420
So something to look at is the complexity
of the content.

42
00:03:14,420 --> 00:03:18,810
To kind of build out the workflow in your
head before you get started.

43
00:03:18,810 --> 00:03:23,560
Another thing that I look at is, is the design
and the formatting of the document consistent

44
00:03:23,560 --> 00:03:24,560
throughout?

45
00:03:24,560 --> 00:03:27,690
So is this document coming from a template?

46
00:03:27,690 --> 00:03:32,230
Is there a consistent use of heading structure
throughout?

47
00:03:32,230 --> 00:03:35,599
Are the lists properly formatted throughout
the document?

48
00:03:35,599 --> 00:03:38,840
How consistent is this page after page?

49
00:03:38,840 --> 00:03:42,260
Sometimes I see documents which are almost
like a Frankenstein document.

50
00:03:42,260 --> 00:03:49,400
Where it's a series of pages that have come
together from various locations.

51
00:03:49,400 --> 00:03:51,049
There's nothing consistent about them.

52
00:03:51,049 --> 00:03:56,940
Every page is unique, and every page has to
be treated like as its own individual puzzle.

53
00:03:56,940 --> 00:04:02,800
So it's just important to note if there's
consistent formatting throughout the document.

54
00:04:02,800 --> 00:04:08,579
There are ways within Equidox to consolidate
steps and to eliminate some redundancies so

55
00:04:08,579 --> 00:04:12,409
that you don't have to repeat the same exact
steps page after page.

56
00:04:12,409 --> 00:04:18,549
Now another thing that I look at always, is
if there is existing tag structure associated

57
00:04:18,549 --> 00:04:20,320
with the document.

58
00:04:20,320 --> 00:04:23,850
And existing tag structure could be good or
bad.

59
00:04:23,850 --> 00:04:28,880
Now many documents, depending on where they
come from… for example, Microsoft Word being

60
00:04:28,880 --> 00:04:34,380
a very common one, when you hit “Save as
PDF” coming out of Microsoft Word oftentimes

61
00:04:34,380 --> 00:04:40,400
there is an automated tagging process where
Microsoft Word will attempt to convert that

62
00:04:40,400 --> 00:04:47,699
word file into a PDF and generate some automated
tags throughout that process.

63
00:04:47,699 --> 00:04:52,860
Now that can be good or bad, depending on
the way that the document was put together.

64
00:04:52,860 --> 00:04:58,550
But the point is that you don't really know
what you're going to get during that conversion

65
00:04:58,550 --> 00:05:01,930
process from Microsoft Word into PDF.

66
00:05:01,930 --> 00:05:07,720
Because how those how those tags are generated
is basically a mystery to the user.

67
00:05:07,720 --> 00:05:10,270
You just have to kind of wait and see what
you get.

68
00:05:10,270 --> 00:05:13,610
There are of course many things that you can
take into consideration when you're designing

69
00:05:13,610 --> 00:05:20,491
your documents to improve the accessibility
of them when you do convert them into a PDF.

70
00:05:20,491 --> 00:05:25,730
But they still need to be looked at because
PDF and Word are different formats and they

71
00:05:25,730 --> 00:05:30,520
just have different ways of marking up documents
from an accessibility standpoint.

72
00:05:30,520 --> 00:05:34,919
So just because your Microsoft Word document
is technically accessible, it does not mean

73
00:05:34,919 --> 00:05:38,500
that the PDF that that generates is going
to be automatically accessible.

74
00:05:38,500 --> 00:05:44,050
So you still need to take into consideration
that the existing tags that were generated

75
00:05:44,050 --> 00:05:49,660
automatically through Microsoft Word, they
might still need to be touched up or adjusted.

76
00:05:49,660 --> 00:05:56,850
Now there are other documents that a that
may have been remediated at some point in

77
00:05:56,850 --> 00:06:03,380
their life cycle where someone might have
opened the document in Adobe Acrobat and pressed

78
00:06:03,380 --> 00:06:08,610
auto tag and just kind of crossed their fingers
and hoped that they got some sort of usable

79
00:06:08,610 --> 00:06:10,520
tag structure.

80
00:06:10,520 --> 00:06:16,199
This can be something that is going to cause
you more problems than be of service.

81
00:06:16,199 --> 00:06:17,729
So these are just things to look at.

82
00:06:17,729 --> 00:06:20,389
If you have existing
tag structure.

83
00:06:20,389 --> 00:06:25,380
What is the quality of those existing tags,
and are they worth maintaining, or is it better

84
00:06:25,380 --> 00:06:28,800
to just use Equidox and start from scratch?

85
00:06:28,800 --> 00:06:34,420
Because of the automation and all of the tools
that Equidox gives you to tag documents, there

86
00:06:34,420 --> 00:06:41,099
are many instances where it can be easier
to just start over rather than trying to rescue

87
00:06:41,099 --> 00:06:45,480
poor existing tags.

88
00:06:45,480 --> 00:06:49,509
Another feature that or another aspect that
I like to look at is does the document require

89
00:06:49,509 --> 00:06:50,509
OCR?

90
00:06:50,509 --> 00:06:53,630
And OCR stands for Optical Character Recognition.

91
00:06:53,630 --> 00:06:59,289
This can be relevant for documents that are
scanned, for example, or documents that might

92
00:06:59,289 --> 00:07:05,240
contain infographics that have text that is
actually locked inside of images.

93
00:07:05,240 --> 00:07:11,770
Now OCR certainly adds another layer of complexity
to remediation because in order to make these

94
00:07:11,770 --> 00:07:17,750
documents compatible with screen readers and
machine-readable, an image is simply an image.

95
00:07:17,750 --> 00:07:21,940
We need to extract text from it if there is
in fact text inside of it.

96
00:07:21,940 --> 00:07:28,050
So if you have a document where someone took
an old newspaper and ran it through a scanner,

97
00:07:28,050 --> 00:07:31,940
technically that document can come out of
that scanner as a PDF.

98
00:07:31,940 --> 00:07:36,250
But it is simply just an image as far as a
machine reader is concerned.

99
00:07:36,250 --> 00:07:41,199
So that this requires an extra step called
Optical Character Recognition, where you're

100
00:07:41,199 --> 00:07:47,940
actually analyzing the image and extracting
the usable searchable, selectable text from

101
00:07:47,940 --> 00:07:48,940
it.

102
00:07:48,940 --> 00:07:54,389
So you have to run this OCR process to extract
that text, which can take it can take a little

103
00:07:54,389 --> 00:07:58,720
bit of extra time, but this is just sort of
the nature of the of the format.

104
00:07:58,720 --> 00:08:03,610
If you're dealing with scanned images then
you have to take the time to make sure that

105
00:08:03,610 --> 00:08:08,690
you are extracting the content from it in
order to make it machine readable.

106
00:08:08,690 --> 00:08:11,680
Another thing to consider is are there form
fields present?

107
00:08:11,680 --> 00:08:17,500
So documents PDF documents that are intended
to be fillable forms oftentimes will have

108
00:08:17,500 --> 00:08:19,280
form fields present within them.

109
00:08:19,280 --> 00:08:26,270
And this is another element that does take
a little bit of extra care because form fields

110
00:08:26,270 --> 00:08:28,819
need to have what are called Tooltips applied
to them.

111
00:08:28,819 --> 00:08:32,190
So if you find that there are form fields
in the document, that's another thing to be

112
00:08:32,190 --> 00:08:36,690
aware of and another thing to sort of budget
your time and your workflow around… making

113
00:08:36,690 --> 00:08:42,800
sure that you that you take the time to properly
handle those form fields.

114
00:08:42,800 --> 00:08:46,700
And then one of the last things that I like
to look at are the images.

115
00:08:46,700 --> 00:08:51,399
So if there are images present in the document,
which many times there will be, how many images

116
00:08:51,399 --> 00:08:52,420
are there?

117
00:08:52,420 --> 00:08:54,260
About how many images per page are there?

118
00:08:54,260 --> 00:09:00,279
I mean, are these images informative are they
technical or are they something that are just

119
00:09:00,279 --> 00:09:01,279
decorative?

120
00:09:01,279 --> 00:09:04,650
Now decorative images could be things like
repetitive logos…

121
00:09:04,650 --> 00:09:09,050
they could be like watermark… or background
images…

122
00:09:09,050 --> 00:09:11,160
they could be stock images.

123
00:09:11,160 --> 00:09:19,779
So some sort of brochure where you just have
two people bending over a computer smiling

124
00:09:19,779 --> 00:09:21,660
at the computer screen…

125
00:09:21,660 --> 00:09:26,410
These are sorts of like stock images that
are not actually adding any content a contextual

126
00:09:26,410 --> 00:09:29,050
value to the document itself.

127
00:09:29,050 --> 00:09:33,269
They're simply there for the aesthetics and
to take up some space and maybe balance out

128
00:09:33,269 --> 00:09:34,610
the page.

129
00:09:34,610 --> 00:09:40,880
So not all images require alt text, but that's
up to the remediator to determine…

130
00:09:40,880 --> 00:09:45,300
If there are images, what kind of alt text
and what kind of effort do you have to make

131
00:09:45,300 --> 00:09:47,530
to apply that alt text?

132
00:09:47,530 --> 00:09:51,410
Perhaps the image… the document is full
of technical images.

133
00:09:51,410 --> 00:09:56,640
Maybe this is like engineering content where
you have a bunch of complex diagrams and graphs

134
00:09:56,640 --> 00:09:57,790
and charts.

135
00:09:57,790 --> 00:10:01,410
If you're just the remediator and you're not
the subject matter expert, you might need

136
00:10:01,410 --> 00:10:06,220
to incorporate someone who knows more about
that subject than you to help you with the

137
00:10:06,220 --> 00:10:07,270
alt text writing.

138
00:10:07,270 --> 00:10:13,000
So these are just things to consider as you
are going through your remediation and as

139
00:10:13,000 --> 00:10:18,380
you're developing your workflow throughout
that document.

140
00:10:18,380 --> 00:10:21,140
Now some collaboration ideas!

141
00:10:21,140 --> 00:10:26,269
Because Equidox is a web-based application,
there's a collaboration aspect built into

142
00:10:26,269 --> 00:10:27,790
the tool.

143
00:10:27,790 --> 00:10:33,980
So we can actually work together and share
documents through the Equidox platform.

144
00:10:33,980 --> 00:10:39,040
And because we work on a concurrent user licensing
model, that means you can have multiple people

145
00:10:39,040 --> 00:10:43,769
logged into the account, into the same account
simultaneously, and you can even have multiple

146
00:10:43,769 --> 00:10:48,540
people working together on the same document
simultaneously.

147
00:10:48,540 --> 00:10:53,910
So Equidox gives you the ability to share
documents with other users or groups and you

148
00:10:53,910 --> 00:10:55,589
can actually divide and conquer.

149
00:10:55,589 --> 00:11:00,269
So let's say you have a 200-page document
and it's got a very tight deadline.

150
00:11:00,269 --> 00:11:03,209
You know, you'd like to get it posted by the
end of the day.

151
00:11:03,209 --> 00:11:08,320
Well rather than one person having to hunch
over their computer all day and kind of panic

152
00:11:08,320 --> 00:11:12,950
about this very large dense document, you
might have five different users and everyone

153
00:11:12,950 --> 00:11:14,480
takes 40 pages.

154
00:11:14,480 --> 00:11:18,540
And then you can kind of power through that
document in an hour or two, rather than having

155
00:11:18,540 --> 00:11:22,750
one person have to stress out and clear their
calendar to make sure that they can get through

156
00:11:22,750 --> 00:11:23,750
it.

157
00:11:23,750 --> 00:11:26,829
So this is all made possible because this
is a cloud-based application.

158
00:11:26,829 --> 00:11:32,250
Being able to share the document and even
have multiple users working together simultaneously

159
00:11:32,250 --> 00:11:36,220
has a lot of advantages built into it.

160
00:11:36,220 --> 00:11:40,850
As I mentioned before, images could require
subject matter experts so depending on your

161
00:11:40,850 --> 00:11:47,950
background or the nature of the document these
images that are contained in the document

162
00:11:47,950 --> 00:11:53,190
might be so complex that you're not exactly
sure how to provide an accurate alt description.

163
00:11:53,190 --> 00:11:58,630
So that again, the advantage of being web-based,
is you could send the URL for the page over

164
00:11:58,630 --> 00:12:02,802
through an instant messenger or through an
email to one of your colleagues, or one of

165
00:12:02,802 --> 00:12:07,590
your teammates who might be more familiar
with the content, and just ask them to take

166
00:12:07,590 --> 00:12:11,089
a few minutes to provide alt text for those
images.

167
00:12:11,089 --> 00:12:13,660
And then you can do the rest of the tagging
of the documents.

168
00:12:13,660 --> 00:12:17,350
So from a structural standpoint, you can set
your reading order, and your headings, and

169
00:12:17,350 --> 00:12:22,079
your lists and your tables, but those images
that require that alternate text you might

170
00:12:22,079 --> 00:12:24,300
not be comfortable trying to provide that…

171
00:12:24,300 --> 00:12:27,100
If they are very technical or very complex
images…

172
00:12:27,100 --> 00:12:30,170
so you can get someone else involved in that
workflow.

173
00:12:30,170 --> 00:12:33,480
Because this is cloud-based, they can be working
on that alt text while you are working on

174
00:12:33,480 --> 00:12:36,370
the rest of the document.

175
00:12:36,370 --> 00:12:42,380
And also in the Equidox application, there
is a validation tool, and a Page Notes feature,

176
00:12:42,380 --> 00:12:47,000
so assuming that you're working together with
a large team on a specific document, you can

177
00:12:47,000 --> 00:12:51,480
use the validation tool to essentially mark
pages as complete.

178
00:12:51,480 --> 00:12:53,570
So that you can avoid redundancies…

179
00:12:53,570 --> 00:12:57,750
so someone isn't coming into a page and making
adjustments and changing things that have

180
00:12:57,750 --> 00:13:00,880
already been marked as validated.

181
00:13:00,880 --> 00:13:06,690
And then also the Page Notes feature is a
way of just leaving yourself or other users

182
00:13:06,690 --> 00:13:07,690
a note.

183
00:13:07,690 --> 00:13:13,449
Perhaps you skipped over an element because
you weren't sure how to approach it, or you'd

184
00:13:13,449 --> 00:13:18,820
like to just have some sort of communication
with your colleagues as to what happened on

185
00:13:18,820 --> 00:13:22,080
that specific page, or how you decided to
approach it.

186
00:13:22,080 --> 00:13:28,360
Where they might be able to sort of replicate
that process on other similar pages.

187
00:13:28,360 --> 00:13:32,501
And Equidox will give you a consolidated list
of all of these various page notes that you

188
00:13:32,501 --> 00:13:34,850
take throughout the throughout the document.

189
00:13:34,850 --> 00:13:40,060
And whether you're an Administrator or a power
user, and you have less skilled less trained

190
00:13:40,060 --> 00:13:44,410
workers working underneath you, you can very
quickly look through the page notes and make

191
00:13:44,410 --> 00:13:46,260
sure that everything has been addressed.

192
00:13:46,260 --> 00:13:50,329
If there were questions, or if there were
issues, you can always just follow up in the

193
00:13:50,329 --> 00:13:55,650
in the application and look at that specific
page and make sure that that issue was addressed.

194
00:13:55,650 --> 00:13:59,710
And we'll go through that as well in the demonstration.

195
00:13:59,710 --> 00:14:01,459
Now eliminating repetition.

196
00:14:01,459 --> 00:14:08,220
So, if we have a lot of different tools built
into Equidox that allow you to programmatically

197
00:14:08,220 --> 00:14:10,259
apply things to all pages.

198
00:14:10,259 --> 00:14:15,089
So, if you are an existing Equidox user or
perhaps you've seen a demonstration before,

199
00:14:15,089 --> 00:14:18,750
you're probably familiar with our Zone detection
slider.

200
00:14:18,750 --> 00:14:24,190
So this is a tool that will redefine or detect
the reading zones within a document.

201
00:14:24,190 --> 00:14:28,470
And if you find that you have a pretty consistent
formatting throughout the entire document,

202
00:14:28,470 --> 00:14:31,209
you can apply this to all pages.

203
00:14:31,209 --> 00:14:33,649
Now the same thing can be done for the reading
order.

204
00:14:33,649 --> 00:14:39,220
So if you have, for example, every page in
the document is a two-column reading order,

205
00:14:39,220 --> 00:14:43,380
you can choose two columns as that reading
order and hit “Apply to all pages” and

206
00:14:43,380 --> 00:14:49,290
then Equidox will programmatically set that
two column layouts to every one of those pages.

207
00:14:49,290 --> 00:14:51,550
We also have the ability to set up a heading
template.

208
00:14:51,550 --> 00:14:58,110
So again, assuming that the document is consistently
formatted and designed throughout, you can

209
00:14:58,110 --> 00:15:00,899
actually identify your heading hierarchy.

210
00:15:00,899 --> 00:15:06,500
So for example if you find that your heading
level twos are all in the same font style,

211
00:15:06,500 --> 00:15:11,670
the same font characteristics, you can hit
a simple checkbox and Equidox will programmatically

212
00:15:11,670 --> 00:15:17,089
look forward in the document and identify
all of the different areas of the document

213
00:15:17,089 --> 00:15:21,461
where you have that same font style and will
automatically apply that heading level two

214
00:15:21,461 --> 00:15:22,461
for you.

215
00:15:22,461 --> 00:15:26,080
And that can be done for all of your various
heading levels.

216
00:15:26,080 --> 00:15:28,150
We also have an “Ignore” feature.

217
00:15:28,150 --> 00:15:34,600
The Ignore feature is the ability to bulk
artifact repetitive elements in a document.

218
00:15:34,600 --> 00:15:41,230
So if you have a footer or a header (maybe
it's like a revision date or a serial number,

219
00:15:41,230 --> 00:15:45,910
something that is just continuously repeating
on every single page) you don't necessarily

220
00:15:45,910 --> 00:15:48,510
need to tag that on every page.

221
00:15:48,510 --> 00:15:54,160
And you can use the ignore feature, if it's
located in the same location, to bulk artifacts

222
00:15:54,160 --> 00:15:57,029
all of that information throughout the entire
document.

223
00:15:57,029 --> 00:16:01,500
So that you're not feeding this redundant
information to the screen reader user page

224
00:16:01,500 --> 00:16:04,850
after page after page.

225
00:16:04,850 --> 00:16:07,720
Now another important part of this is to keep
a rhythm.

226
00:16:07,720 --> 00:16:13,460
So, I find when I’m working on longer documents
particularly, that remediation can be a very

227
00:16:13,460 --> 00:16:15,459
rhythmic process.

228
00:16:15,459 --> 00:16:20,860
So if you have similar layouts and designs
throughout the document, a lot of those pages

229
00:16:20,860 --> 00:16:23,720
will require the same techniques.

230
00:16:23,720 --> 00:16:29,220
And once you get through a handful of pages,
you will start to eliminate redundancies and

231
00:16:29,220 --> 00:16:31,030
consolidate steps.

232
00:16:31,030 --> 00:16:34,709
So that you can start just eliminating, literally
eliminating clicks.

233
00:16:34,709 --> 00:16:39,050
And if you're if you're doing a high volume
type of project where you have a lot of pages

234
00:16:39,050 --> 00:16:44,800
to get through, if you can eliminate a handful
of clicks or a handful of steps on every page,

235
00:16:44,800 --> 00:16:48,829
that really adds up to a lot of time savings
at the end of that document, or the end of

236
00:16:48,829 --> 00:16:49,829
that project.

237
00:16:49,829 --> 00:16:57,300
So, as you get into a rhythm, I find it's
very helpful to try to stay within that rhythm.

238
00:16:57,300 --> 00:17:03,540
And then the outlier type of pages, perhaps
you come across a page where you have a completely

239
00:17:03,540 --> 00:17:08,339
different layout, or there's a large infographic
or a flow chart or something that is completely

240
00:17:08,339 --> 00:17:12,459
different than the pages that you've been
previously working, on you can always skip

241
00:17:12,459 --> 00:17:16,590
over that page and just return to it at the
very end of the process.

242
00:17:16,590 --> 00:17:21,730
So rather than disrupting that flow that you've
just developed for the previous 10 or 20 pages,

243
00:17:21,730 --> 00:17:27,110
you can just skip over it and go back to that
flow chart or back to that infographic that

244
00:17:27,110 --> 00:17:29,530
requires a lot of extra work.

245
00:17:29,530 --> 00:17:33,890
And then you can kind of finish off the document
at the very end by addressing that outlier

246
00:17:33,890 --> 00:17:38,400
type of page at the very end of the workflow.

247
00:17:38,400 --> 00:17:41,720
Now another part of Equidox is validating
as you go.

248
00:17:41,720 --> 00:17:46,179
And this is something that we will also cover
in the in the demonstration when we get to

249
00:17:46,179 --> 00:17:47,179
it.

250
00:17:47,179 --> 00:17:52,140
So the HTML preview, if you've used Equidox
before I’m sure you're familiar with this,

251
00:17:52,140 --> 00:17:58,299
but the HTML preview is a way of checking
the structure that you've built on the page

252
00:17:58,299 --> 00:18:02,270
in terms of reading order, your headings,
lists, tables.

253
00:18:02,270 --> 00:18:07,740
All of that information that you have structured
in that remediation process will be previewable

254
00:18:07,740 --> 00:18:09,930
in the HTML preview page.

255
00:18:09,930 --> 00:18:15,309
And this is a way of just checking your work
before moving on to a subsequent page.

256
00:18:15,309 --> 00:18:19,340
So, you'll get a lot of feedback from the
application as you're going through at the

257
00:18:19,340 --> 00:18:20,580
page level.

258
00:18:20,580 --> 00:18:25,150
And if you're struggling with a certain page,
you can always revisit it and finish it off

259
00:18:25,150 --> 00:18:26,150
at the very end.

260
00:18:26,150 --> 00:18:29,440
So, it's just a way of checking your work
as you go.

261
00:18:29,440 --> 00:18:35,860
Also when you are finished working through
a document, when you export the PDF, Equidox

262
00:18:35,860 --> 00:18:40,370
will run a series of accessibility checks
on that export.

263
00:18:40,370 --> 00:18:42,360
Now the bullet points here on the slide…

264
00:18:42,360 --> 00:18:47,280
this is not a comprehensive list of everything
that it is checking for, but just a few examples

265
00:18:47,280 --> 00:18:52,049
of the things that it's looking for are missing
alt text for images… if you have illogical

266
00:18:52,049 --> 00:18:55,030
reading orders or logical heading structures…

267
00:18:55,030 --> 00:18:59,850
invalid merges… for example, if you are
trying to merge a list with a text zone, for

268
00:18:59,850 --> 00:19:01,549
example…

269
00:19:01,549 --> 00:19:07,220
And then the errors that Equidox finds throughout
this accessibility check, they are directly

270
00:19:07,220 --> 00:19:10,220
linked to the specific page where it was found
on.

271
00:19:10,220 --> 00:19:13,570
And they give you a clickable link where you
can just select the link and go straight to

272
00:19:13,570 --> 00:19:15,720
the page where that error was found.

273
00:19:15,720 --> 00:19:20,340
And then you can make your correction in Equidox
and then just re-export the document once

274
00:19:20,340 --> 00:19:23,090
you've worked through all of those errors.

275
00:19:23,090 --> 00:19:31,440
So this accessibility check that we are applying
on export, as I said it's not a fully comprehensive

276
00:19:31,440 --> 00:19:34,620
accessibility report that you are getting…

277
00:19:34,620 --> 00:19:40,770
We still highly recommend using third-party
validation tools and especially screen readers.

278
00:19:40,770 --> 00:19:45,850
Screen readers to replicate exactly what a
screen reader user is experiencing when they're

279
00:19:45,850 --> 00:19:47,600
working with this document.

280
00:19:47,600 --> 00:19:49,650
Now automated accessibility checking…

281
00:19:49,650 --> 00:19:54,570
it can be automated to a certain degree but
there are many, many different elements that

282
00:19:54,570 --> 00:19:59,929
it can miss because it's simply looking for
technical requirements.

283
00:19:59,929 --> 00:20:04,890
But it's not actually able to replicate what
an end user is going through with a screen

284
00:20:04,890 --> 00:20:05,890
reader.

285
00:20:05,890 --> 00:20:10,020
So it's very important to make sure that you
are that you are checking your work as you

286
00:20:10,020 --> 00:20:11,980
as you export the document.

287
00:20:11,980 --> 00:20:17,110
Putting it through various third-party tools,
or using screen readers which is the best

288
00:20:17,110 --> 00:20:22,420
and the ultimate check for the actual usability
of that document.

289
00:20:22,420 --> 00:20:28,600
Okay, so let's jump into Equidox now, and
we'll work through a couple of examples here…

290
00:20:28,600 --> 00:20:31,450
and I realize that we’ve got about 10 minutes
left.

291
00:20:31,450 --> 00:20:33,659
So I’m going to go pretty quickly through
these documents.

292
00:20:33,659 --> 00:20:37,570
But that's sort of the point with Equidox,
is to be able to move very efficiently through

293
00:20:37,570 --> 00:20:38,570
them.

294
00:20:38,570 --> 00:20:42,669
So I have a couple of documents pre-loaded
here into Equidox and I want to start with

295
00:20:42,669 --> 00:20:43,710
this one here.

296
00:20:43,710 --> 00:20:46,210
And this is just a one-page document.

297
00:20:46,210 --> 00:20:49,740
So as I said before, when you have a one-page
document, sometimes it's best to just jump

298
00:20:49,740 --> 00:20:55,530
right in and see what you have rather than
trying to spend a few minutes to be very strategic

299
00:20:55,530 --> 00:20:56,840
and tactical with it.

300
00:20:56,840 --> 00:21:01,110
You can just approach this page and just see
what you've got in here, and start working

301
00:21:01,110 --> 00:21:02,110
through it.

302
00:21:02,110 --> 00:21:06,640
Now what I notice right away is I have some
images, I have some headings, I have text,

303
00:21:06,640 --> 00:21:11,720
I have a list here, ( in fact, a nested list),
and a table down here at the very bottom.

304
00:21:11,720 --> 00:21:16,110
So just to quickly work through this, what
I’m going to do is I’m going to start

305
00:21:16,110 --> 00:21:20,140
by, at the top and just start artifacting
images that I don't really need.

306
00:21:20,140 --> 00:21:24,120
I don't know what this is, just a background
sort of watermark type of image…

307
00:21:24,120 --> 00:21:31,880
Here's another image of black and white dog
holding a briefcase…

308
00:21:31,880 --> 00:21:33,799
here's our Equidox logo up here.

309
00:21:33,799 --> 00:21:37,000
So I’m just typing in all descriptions for
them.

310
00:21:37,000 --> 00:21:38,730
Now I’m going to work on my heading structures.

311
00:21:38,730 --> 00:21:39,730
So I hit…

312
00:21:39,730 --> 00:21:43,750
everything right now is currently set as a
text zone because this document was untagged

313
00:21:43,750 --> 00:21:44,830
to begin with.

314
00:21:44,830 --> 00:21:49,410
So if I just start hitting my appropriate
keyboard shortcuts to set my headings.

315
00:21:49,410 --> 00:21:54,510
Heading level 1, heading level 2, heading
level 2, and heading level 2.

316
00:21:54,510 --> 00:21:57,580
Now I’ve defined my headings for this page.

317
00:21:57,580 --> 00:21:59,110
Next up is this list.

318
00:21:59,110 --> 00:22:04,059
As you can tell this is a nested list, and
what I’ll do is I will hit “L” on my

319
00:22:04,059 --> 00:22:06,580
keyboard and I will use my List Detection
slider.

320
00:22:06,580 --> 00:22:10,980
So if I move this over, Equidox is going to
look inside of this list and it's going to

321
00:22:10,980 --> 00:22:14,950
pick up these nested layers inside of the
individual list items.

322
00:22:14,950 --> 00:22:18,929
So if you can tell, or if you've used Equidox
before, you'll notice that this list has been

323
00:22:18,929 --> 00:22:23,510
automatically detected, and the nestings have
been detected inside of those individual list

324
00:22:23,510 --> 00:22:26,380
items, simply by moving this list detection
slider.

325
00:22:26,380 --> 00:22:30,960
So this is using artificial intelligence and
machine learning to better understand list

326
00:22:30,960 --> 00:22:34,710
structures, and something like this only takes
a few seconds.

327
00:22:34,710 --> 00:22:38,590
Now this down here is a table, and if you
can tell I have a bunch of different text

328
00:22:38,590 --> 00:22:43,020
zones covering up this table because it has
no existing tag structure.

329
00:22:43,020 --> 00:22:46,049
So I’m not going to worry too much about
those existing zones.

330
00:22:46,049 --> 00:22:50,159
I’m just going to draw a single zone over
the entire table, hit “T” on my keyboard

331
00:22:50,159 --> 00:22:52,919
and open up the Table Editor.

332
00:22:52,919 --> 00:22:57,640
Once I open up the Table Editor, I’m going
to then use my detection tools here.

333
00:22:57,640 --> 00:23:03,140
So again, the computer vision and machine
learning to redefine the table structure.

334
00:23:03,140 --> 00:23:08,250
So the individual rows and columns and cells
have been all identified through our artificial

335
00:23:08,250 --> 00:23:09,280
intelligence.

336
00:23:09,280 --> 00:23:14,980
And then I can actually use the spanning feature
to span across here for these various years.

337
00:23:14,980 --> 00:23:18,010
I’m just holding Shift and pressing “S”
on my keyboard.

338
00:23:18,010 --> 00:23:21,919
And then this is an example of a table where
I actually have two rows of column headers

339
00:23:21,919 --> 00:23:24,100
so 2017 and 18.

340
00:23:24,100 --> 00:23:29,400
And then the various Q1s through Q4s… these
are also going to be read as column headers.

341
00:23:29,400 --> 00:23:34,340
So, I will just change my column header from
the standard one to two.

342
00:23:34,340 --> 00:23:39,700
And then if I look at my HTML preview of this
table, what I see is a nice clean accessible

343
00:23:39,700 --> 00:23:41,090
HTML table.

344
00:23:41,090 --> 00:23:44,669
And then the great part of Equidox is that
it will take this table and it will automatically

345
00:23:44,669 --> 00:23:48,160
convert it into a PDF tag tree.

346
00:23:48,160 --> 00:23:52,610
Now when I save this table and close out,
all of those individual zones that were previously

347
00:23:52,610 --> 00:23:54,440
there have been artifacted.

348
00:23:54,440 --> 00:23:57,600
So I’m just left with the single Table zone.

349
00:23:57,600 --> 00:24:01,480
So, the last thing I need to do now is just
reorder the page.

350
00:24:01,480 --> 00:24:05,261
Because I’ve removed some zones and I’ve
drawn some zones manually, my reading order

351
00:24:05,261 --> 00:24:06,500
is a little bit out of sync.

352
00:24:06,500 --> 00:24:10,020
So, all I’m going to do is go to the Page
Tab and press “Reorder.”

353
00:24:10,020 --> 00:24:14,080
And I can always check my HTML preview just
to validate that I’m satisfied with everything

354
00:24:14,080 --> 00:24:15,080
that I see.

355
00:24:15,080 --> 00:24:19,450
This all looks pretty good to me, so what
I’ll do is I will just save the page.

356
00:24:19,450 --> 00:24:23,270
I can mark it as validated if I'd like to,
just to remind myself that I’ve already

357
00:24:23,270 --> 00:24:28,820
finished this> And if I go to the Output Tab,
I’m then able to generate the PDF.

358
00:24:28,820 --> 00:24:32,890
Now when I generate the PDF, hopefully I’m
not going to get any errors, and in fact I

359
00:24:32,890 --> 00:24:33,950
did not.

360
00:24:33,950 --> 00:24:38,330
But now the PDF opens up in a separate tab
for me in my browser.

361
00:24:38,330 --> 00:24:41,110
If you can tell, nothing is visually changed
about this document.

362
00:24:41,110 --> 00:24:45,380
It's exactly the same, but all of the tag
structure that I’ve just set up during that

363
00:24:45,380 --> 00:24:52,580
remediation process is going to be automatically
converted into those PDF tags for me.

364
00:24:52,580 --> 00:24:57,200
Now with just a few minutes left, I want to
take a look at a bit of a longer document.

365
00:24:57,200 --> 00:25:01,289
And this is a document here where it's quite
a bit older.

366
00:25:01,289 --> 00:25:07,130
It's been it was created in 2007, and actually
this document has an OCR page.

367
00:25:07,130 --> 00:25:11,330
So this kind of incorporates a few different
elements into it one place.

368
00:25:11,330 --> 00:25:13,960
I like to start, when working with longer
documents…

369
00:25:13,960 --> 00:25:17,530
Of course I’ll take a look at the thumbnails
for each of the pages and just try to get

370
00:25:17,530 --> 00:25:21,620
an understanding for what is this document
actually made up of.

371
00:25:21,620 --> 00:25:24,700
But then one thing I like to look at is the
Images Tab.

372
00:25:24,700 --> 00:25:29,360
Now the Images Tab, I can see here there are
quite a few images in this document.

373
00:25:29,360 --> 00:25:33,570
And they are somewhat technical, because these
are screenshots and sort of instructional

374
00:25:33,570 --> 00:25:34,570
images.

375
00:25:34,570 --> 00:25:38,309
And some of the images already have alt text,
whereas others do not.

376
00:25:38,309 --> 00:25:41,840
So this is something to pay attention to as
you're going through page by page, making

377
00:25:41,840 --> 00:25:47,380
sure that your images have all been described
or in some cases artifacted if you do not

378
00:25:47,380 --> 00:25:50,860
feel that they need to be given a description.

379
00:25:50,860 --> 00:25:54,230
So this is something where I’m not really
able to tell, based on the context of the

380
00:25:54,230 --> 00:25:56,669
document, what this image is trying to represent.

381
00:25:56,669 --> 00:25:59,710
So I don't feel comfortable providing alt
text quite yet.

382
00:25:59,710 --> 00:26:03,870
I’ll wait till I get through the document
and go through page by page.

383
00:26:03,870 --> 00:26:10,159
Since I have OCR here on this very first page,
I’m just going to use my Zone Detector.

384
00:26:10,159 --> 00:26:14,690
This Zone Detection tool is something that
we have not yet demoed, but it's definitely

385
00:26:14,690 --> 00:26:20,510
a huge part of the Equidox system is using
that Zone Detection tool.

386
00:26:20,510 --> 00:26:23,590
And you can redefine the granularity of the
reading zones.

387
00:26:23,590 --> 00:26:27,059
I have this background image, so I’m going
to just get rid of that background image,

388
00:26:27,059 --> 00:26:28,090
and I’m going to OCR all zones.

389
00:26:28,090 --> 00:26:30,450
So, I’m going to convert this scanned page.

390
00:26:30,450 --> 00:26:34,299
If you can tell, this is actually scanned
I’m actually going to convert it now into

391
00:26:34,299 --> 00:26:37,340
selectable searchable usable text.

392
00:26:37,340 --> 00:26:39,950
And once I’ve done that, I can then set
my heading level.

393
00:26:39,950 --> 00:26:44,250
So this is a heading level one, because it's
on the very first page, nice big bold font.

394
00:26:44,250 --> 00:26:46,110
And you can check your HTML preview.

395
00:26:46,110 --> 00:26:51,190
So I’m pretty satisfied with that page I
can then move on to the next.

396
00:26:51,190 --> 00:26:56,380
Now if we had the if we had more time I would
basically just be working through this document

397
00:26:56,380 --> 00:26:57,420
very quickly.

398
00:26:57,420 --> 00:27:03,240
A page like this, where I had existing tag
structure of one big P tag covering the entire

399
00:27:03,240 --> 00:27:07,690
page… this can be very quickly adjusted,
setting my headings and setting my list down

400
00:27:07,690 --> 00:27:08,690
here.

401
00:27:08,690 --> 00:27:12,330
So if I just move my List Detector from left
to right that will define this list with all

402
00:27:12,330 --> 00:27:14,390
the individual elements.

403
00:27:14,390 --> 00:27:20,740
I can check my HTML preview… perhaps I want
to get rid of these headers and footers, for

404
00:27:20,740 --> 00:27:23,460
example, like for the revision date…

405
00:27:23,460 --> 00:27:26,370
This is the type of information that can just
be very redundant…

406
00:27:26,370 --> 00:27:31,670
So, using the Ignore feature as I mentioned
before, if I just draw a zone that's intersecting

407
00:27:31,670 --> 00:27:35,799
with that zone, I press “I” on my keyboard,
that will become an Ignore zone.

408
00:27:35,799 --> 00:27:41,190
And then if I go back to the preview, the
revision date is no longer part of the tagged

409
00:27:41,190 --> 00:27:43,299
elements.

410
00:27:43,299 --> 00:27:49,480
So if I go forward into the document and I
look at other pages, the Ignore zone is already

411
00:27:49,480 --> 00:27:51,260
waiting for me.

412
00:27:51,260 --> 00:27:55,830
So it's already there, ready to remove that
revision date.

413
00:27:55,830 --> 00:28:02,640
Now this is a 14- page document, and working
through it I would spend probably about 30

414
00:28:02,640 --> 00:28:05,340
seconds per page on average to get through
this.

415
00:28:05,340 --> 00:28:10,820
So it would be about a five- to ten-minute
document to get through using some of the

416
00:28:10,820 --> 00:28:14,030
steps that I’ve already explained and described.

417
00:28:14,030 --> 00:28:18,430
You know, the Ignore feature, setting up a
heading template, using the auto detection

418
00:28:18,430 --> 00:28:24,950
feature of the zones, you can very quickly
get through content like this.

419
00:28:24,950 --> 00:28:28,520
Because we've taken the time to be a little
bit more strategic in how we approach it.

420
00:28:28,520 --> 00:28:32,010
So you don't have to remember to remove this
on every page, you don't have to set your

421
00:28:32,010 --> 00:28:35,980
headings on every page, because you can do
it programmatically through the heading template.

422
00:28:35,980 --> 00:28:41,780
But this is what a sort of a more refined
workflow can look like when you're working

423
00:28:41,780 --> 00:28:43,559
through longer documents.

424
00:28:43,559 --> 00:28:45,270
And even longer than 14 pages.

425
00:28:45,270 --> 00:28:49,460
Many times, you might have documents that
are hundreds of pages, and so the time savings

426
00:28:49,460 --> 00:28:54,409
can really be multiplied over those various
pages.

427
00:28:54,409 --> 00:28:58,610
And then when you export a document like this,
(keeping in mind that I have not addressed

428
00:28:58,610 --> 00:29:04,070
every page, this is going to actually produce
some error messages for me) so this is where

429
00:29:04,070 --> 00:29:08,600
I just wanted to show you that, for example,
those missing images and as well as some other

430
00:29:08,600 --> 00:29:13,120
elements that I have not yet touched, these
are going to populate in this in this error

431
00:29:13,120 --> 00:29:14,470
list for me.

432
00:29:14,470 --> 00:29:18,890
And if I were actually remediating this document,
I would be able to go to the list of errors

433
00:29:18,890 --> 00:29:23,600
or warnings, and I'd be able to click on these
warning messages and go directly to the page

434
00:29:23,600 --> 00:29:27,390
where they're located, fix them and then regenerate
the PDF.

435
00:29:27,390 --> 00:29:31,230
Now I realize that we are just about out of
time, and I want to be respectful of everyone

436
00:29:31,230 --> 00:29:35,419
else's time, so I’m actually going to just
jump back to the slide deck here.

437
00:29:35,419 --> 00:29:39,880
And I will just say thank you again for everyone
for joining today.

438
00:29:39,880 --> 00:29:45,350
As I said before if you have any specific
questions or feedback, we would love to chat

439
00:29:45,350 --> 00:29:50,750
with you and talk about how Equidox can fit
into your PDF remediation workflow.

440
00:29:50,750 --> 00:29:56,480
We can do another more specific demonstration,
working through some of your documents if

441
00:29:56,480 --> 00:30:01,850
you'd like, and talk about how to define and
refine a workflow for yourself.

442
00:30:01,850 --> 00:30:04,880
For more information about how Equidox Software
Company can help you

443
00:30:04,880 --> 00:30:10,799
with PDF accessibility
Email us at EquidoxSales@equidox.co

444
00:30:10,799 --> 00:30:14,540
Or give us a call at
216-529-3030

445
00:30:14,540 --> 00:30:19,450
Or visit our website at www.equidox.co