Equidox AI Demonstration

Watch our cutting-edge fully automated PDF remediation solution in action!

Video transcript

[Pat Needles] Hi everybody my name is Pat Needles. I am a Vice President here at the Equidox Software Company. I first wanted to thank each and every one of you for making yourself available today to join us for what proves to be a very exciting and new technology that Equidox has developed and we're super excited to show you how that works and the value that is associated with that. And you can see on this first slide up top our mission statement, which is “enabling PDF accessibility through intelligent automated solutions.” and that's kind of a common theme here you'll see that throughout today's presentation and discussion about the genesis of Equidox. Where we started where we've been and where we're at today. So we're excited to talk to the folks especially we've never ever heard of Equidox. This might be their first time at one of our webinars – but welcome to you and we're looking forward to showing you what we have today. (you can move forward Dan) So what we're going to talk about today as far as the agenda is concerned is, who is Equidox where did we come from, where are we going. As I had mentioned before some super exciting things that we're going to talk about. Why make PDFs accessible. I think most people on this call understand a lot of the reasons why you might want to do that and that sort of thing and so we're also going to talk today about our AI-powered PDF accessibility tool and how that works and the value of that. And of course we'll get into the most important thing here today which is the actual demo of the AI solution. So who is Equidox and where did we come from? Equidox the genesis behind it starts in about 2011 when we were asked by a Canadian federal government to build a software tool that would make PDFs accessible for people with disabilities. And when we started this process and building the tool our first output was PDF to HTML. Soon after that we grew to a PDF-accessible output and from there we have built in a lot of different types of automation into the tool that we will definitely be talking about. We have built over say, over the last three years, some detection tools into our software application that has made PDF remediation much simpler and more efficient. And any of those… any of you folks that have been that have done remediation that is on this call today understands that this is a very painful process. It can be time-consuming and we've taken a lot of that heavy lifting off of the table by building in tools into the software application that allows for fast and easy remediation. And then secondly what we're going to be showing you today is our AI-powered software solution for high-volume templated at scale recurring documents. (okay Dan) so you know we talk about the challenges of PDF remediation and how difficult that can be you know it's very labor intensive. Folks need accessibility skills in order to remediate documents properly. It can be very expensive if you are outsourcing these types of activities to third parties. There is the possibility of human error while remediating documents on challenging Timelines. (Go ahead, Dan) So the Equidox AI solution as I had mentioned earlier is a fully automated PDF solution that really takes the human element out of actual remediation. And so what we're going to show you today is how we import documents into our system, how they run through our AI engine, and how they come out completely tagged and completely accessible. There is no need as I had mentioned for any human intervention for this too. And we are really excited to show you how this works here today. So with that, I'll go ahead and I will turn it over to you Dan, to do the demonstration of the tool. [Dan Tuleta] Okay great, thanks, Pat. So before we get into the actual demonstration of Equidox AI I’m just going to set the table for a little bit more just in case anyone is new to accessibility. I'm just going to talk through a little bit of the process and how it works and also why it's important to do this so just so everyone's aware. Equidox in in cooperation with the NFB conducted a survey of over 250 blind and low vision assistive technology users. So that would primarily be people who are using screen reading technology in order to read digital content on their computer screen. And we have found, after surveying these 250 blind and low-vision users that at least two-thirds of PDFs that they interact with on a regular basis are not accessible. So they cannot interact with those documents if they were to use their assistive technology. Adobe, who owns the PDF format estimates that there are over three trillion trillion PDFs in circulation today. So that means there are about two trillion completely inaccessible documents out there that are floating around in circulation. So that is a staggering amount of documents and many people who aren't blind or have low vision and don't use assistive technology, are probably interacting with PDFs on a regular basis and you don't even really think about it. But those PDFs present a huge barrier to understanding information for people who who rely on their assistive technology to interact with the content. So the main takeaway from from this slide here many people on this call have probably heard the the acronym WCAG, which stands for the Web Content Accessibility Guidelines maybe you've heard of the section 508 standards. We're not going to get into the alphabet soup and start breaking down the differences and the similarities between these different standards and guidelines but the main takeaway from this is that these are the standards that are being used to measure the accessibility of websites and documents and digital content and the number of lawsuits that we're seeing every single year is on a steady rise so the lawsuits are coming in fast and furious around websites and documents and digital accessibility as we move to a further and further digital world world. So organizations have to be aware that they are not exempt from making sure that their content is accessible and you have a responsibility to make sure that you're providing digital content in a way that can be used by everyone. So with that said, as Pat kind of alluded to a few slides back, the the challenge with PDF accessibility for many organizations is just the sheer volume of documents that they produce and distribute. So the the number of pages can make it extremely difficult to manage on a consistent basis. Outsourcing this work is extremely expensive. You also lose control of the deadlines making sure that that outsourced company is getting those documents back to you in an efficient manner and meeting your deadlines. You can lose… you can have concerns over the security and the privacy of these documents if you have anything that needs to remain confidential and also you're putting the faith in these outsource firms that they are going to be delivering high-quality work which they may not be. And also remediating these documents internally for these organizations has a lot of the same challenges but mainly you have to dedicate very large numbers of staff to be able to manage the volume of documents and it becomes a big killer of time for these organizations and the employees that are the ones that are having to do the actual remediation work. So what we have done is we have introduced Equidox AI which can fully automate the accessibility portion of making these documents accessible using artificial intelligence, a combination of machine learning and computer vision. So we are essentially applying training to these artificially intelligent models to teach it what these documents look like, the different structures that are making them up and then it can fully automate the process to ensure that the documents are accessible and being delivered to the customer without any issues of using it with their screen reading technology. So Equidox AI, it all starts with human programming. So I just want to make it clear that Equidox AI is not “auto-tagging.” There are many people who will throw around the term “auto-tag,” meaning that it's automatically creating tags for the PDF and the tags are what are being used by screen readers and assistive technology to interact with the content. Equidox, although it is applying tags in an automated way, it is not auto-tagging. Auto-tagging requires basically hard coding to teach an auto-tagger that this type of font should be a certain type of tag. It is very inaccurate it's very unreliable it will not produce compliant or usable documents. So what we do with Equidox is we start with human work being done up front. We train the AI models based on the structures of that specific PDF template that whatever that template may look like, it could be a bank statement, it could be a utility bill, it could be an explanation of benefits on an insurance form… So we have a combination of our data scientists who are doing the actual AI training, we have our software developers who are implementing this technology, and we of course have accessibility experts on staff who can assist in that training of the AI models because they know exactly how to tag a PDF document. So just to go through a little bit of the process… So Equidox AI is implemented postprocessing. And what I mean by that is you do not have to completely uninstall and recreate a process for building your PDF documents. Many organizations already have a very well-established process in place to create and deliver these documents to their customers. We are are placing ourselves into that workflow seamlessly without having to completely rebuild that process from the ground up which could be a very time-consuming and costly endeavor to do so. The first step with Equidox AI is, you would provide sample documents to us in order for us to analyze the documents. And then we would train the AI models based on how we analyze those documents. So we teach the AI models all of the different elements that are that are accounted for in your template. We apply that training and then we would integrate the workflow so we would figure out how do we want to deliver these documents to the customer. Is this something where they are downloading a PDF from one of your portals? Maybe it's their bank statement, for example, or is this something where you would like to manage it internally, where perhaps you have a bulk import of documents every single month that need to be made accessible before they go and get posted online? Once that is that decision is made as to how we're going to integrate the workflow, the process would begin where Equidox would apply the tag structure to these documents. The documents are then exported from the system and then they are delivered through whatever mechanism we've determined. Whether it's directly to a specific customer of yours, or if it's to some sort of data repository so that these documents can be uploaded to your internet or website or wherever it needs to go. So just a a quick rundown of how we analyze PDF documents we don't need to get too technical with this but basically these color dots that we're seeing on the screen here these color dots just represent the AI's analysis of different variations of documents. So they are grouped together in different areas by the AI based on different anchors that it's seeing. So for example these documents all contain pie charts so that is what would group them together in a specific cluster, and it would trigger the AI to look for certain variables and certain elements on this PDF that are consistent with the training that we have provided. Here's another example where you can see that a number of documents, in this case, are sorted into different segments based on the number of columns. So you have certain pages that might have two-column layouts. Other pages might have three or four or a single layer or single column layout per page. So it's always based on the template and the training that we provide from that initial analysis of the documents that you've given to us so that we can teach the model exactly how to tag the various elements. One final thing to call out is just the ability to even tag tables. So if you think of like an easy example to think of would be like your credit card bill it's a very static document could be the same document template can be provided to hundreds of thousands maybe even millions of customers for a credit card company. And that document is being produced on a monthly basis. Certain customers of that credit card company might have a single charge for the month on that credit card and their statement might be a single page with a single line item for a single charge in that table. Another customer might have 500 charges on that credit card for the month and that statement, although it comes from the same template, will be very different. It might be 10 pages long and have page after page of tables. So we are able to actually find these variances and we can account for them to make sure that everything is being tagged accurately. So it's not just a static layout on every page there can be variables across these templates and we're able to train the AI models to identify these variables and account for them accordingly. So the the training of the AI model, just so everyone is aware, we do define the different elements. So if you've never tagged a PDF document before, PDF documents will contain primarily text but there has to be a structure applied to the the text on the page. For example, there are headings that are used for navigation purposes. Tables have specific layouts that need to be tagged in a certain way so that assistive technology can interact with them. There could be lists. The reading order is a very important aspect as well. If a page has three columns a screen reader needs to be told to read down the left-hand column first, then the middle column, then the right-hand column. If it's not trained on how to do that, the screen reader could easily just read clear across the top line of all three columns rendering the entire page as completely useless. So these are the types of training that we that we put into the the models to teach it for all of these specific variables for the specific template. And then of course we have to test the model. So this is not just a one-time we teach it what this document looks like and we just hope for the best. We of course test these models and continue to train them throughout. And so our goal is always to both pass the automated checkers as well as making sure that these documents are fully usable for the end user who's going to be interacting with them. So we do apply both manual checks as well using screen readers to sort of replicate how a blind user might be interacting with that document. So these are very very important aspects of PDF accessibility not just auto-tagging and hoping that it passes an automated checker. It really only matters if the end user that's receiving the document is able to interact with it and that's what really matters most to us. And also, I alluded to this earlier, but there are different methods of deploying Equidox AI technology. So we do have we do have an interface that we have built which I will be demonstrating here in just a moment. So we've built an interface that a customer would be able to run this technology themselves if they prefer. We can also embed this technology through through APIs. So like I said before, we can make sure that this is fitting into your existing workflow and you're able to apply the tags and deliver an accessible document in a fully seamless transition where no human is required at all. And then we can also operate this as a managed service. So we can run the entire process and technology for you and deliver those documents to you, depending on the use case, depending on security requirements, anything. Every customer may have some different requirements and we're here to work with you and figure out the best method of delivering these documents in an accessible way. Okay so what I'm going to do now is actually transition out of this slide deck and we will get into the actual demonstration of Equidox AI. So if I just go to this other tab here I have in my browser, this is the interface that we have built just to kind of demonstrate the technology to make it easier to understand what's actually happening. So right now I have a completely blank slate I have nothing really to work with as we can see so what I'm going to do first is I'm going to go to the Upload Documents tab. When I arrive at this tab here I can then just open up the folders on my hard drive and in this case I'll just drag a .zip file that contains some bank statements that we have scrubbed. See these are just like sample bank statements and we will upload that .zip file containing the financial statements. So once these documents upload, I can then go to the Run Batch tab here on the left-hand side. I will then choose the model that I want to apply. In this case, here we've only got three models currently uploaded to my little interface here for demo purposes, so what I'm going to do is I'm going to select the statement model. I will then choose the .zip radio button here and I will select that .zip folder that I have just uploaded just a second ago. Once I've selected the .zip file and I've selected my appropriate model, I just hit Run Batch and this will kick off a process where Equidox is currently ingesting the documents. It is looking at them kind of page by page, one by one and it is taking all of that model training that we have done ahead of time and it is automatically tagging these PDFs. So we'll see these green lights start to light up here as we work through the process. So you can see one of them is already complete, and we will see more of these green lights kind of lighting up in just a few seconds. Now keep in mind that this is just a demo environment of this technology. Equidox AI can be scaled to meet much higher demands and throughput. Of course, certain organizations might require literally hundreds of thousands of documents every single day to be remediated. We can just dedicate additional resources to it in order to power it to meet those types of demands for speed. While these documents are going through the process, I'm just going to open up one of them here just so that you can see what these documents look like. So if I open up like randomly statement number four here, what I can see is that this is a sort of generic-looking bank statement. We have images up here, we have text, we have headings, we have tables. But this document is completely untagged in its current form. If a screen reader was to interact with this document it would tell them that this is a blank PDF. So they would not be able to interact with this. They would have no idea what their monthly transactions were. They wouldn't know what their balance due was. Obviously, that's a huge problem because you want to… some people really need to make sure that they are being charged appropriately for all of their transactions for the month and they, of course, need to know how much they need to pay for their credit card bill. So we need to make sure that these documents are being tagged and delivered in an accessible way. So with that said while I was talking there the whole process finished. I can see the status is now done. So all of those documents finished processing. I can look at the individual PDF files. So if I were to let's say just download one of them I will download this and save it to my desktop. And if I were to open this up in Adobe Acrobat, I will see that this version of the document is now fully tagged. So if you're familiar with tag structures, you know how complicated it is to manually set up these types of tag trees in Adobe Acrobat. If you're doing it manually it's a very time-consuming process. It requires a lot of technical expertise and even if you are the best Adobe Acrobat user in the world it is still very very time-consuming. So at the scale of things like statements, where there could be literally hundreds of thousands of them produced every single day by a single organization, it is completely unsustainable to expect human remediators to go through the process of tagging these elements manually. But now if I look at this version of the PDF, I have the logo has been described. We've trained it to apply this company logo alt description to the logo whenever it sees this particular image. We have text identified so the entire document is tagged. As we can see here, we have text identified, so the bank name and the customer name. We have our figure tag which is just the image, we have our heading level one. As we tab through we can see we have an H2 for the account summary, the table which is nested below it has been properly tagged. So you can see all of the rows of the table are properly tagged. We jump down to another heading level two for the deposits and credits, another table that is properly tagged, another heading, another table… All of these different elements have been accurately accounted for because of the initial training that we have put into this document template. So at the scale of a company that is mass producing these types of documents, we can now fully automate that process so none of your staff has to spend countless hours of their week remediating PDFs. This type of process can be applied to other templates. So it's not just for bank statements. It could be explanations of benefits, it could be bills, like utility bills or medical bills, or medical statements and reports and test results. There's an endless amount of of use cases for Equidox AI. It really just comes down to the customer and the types of documents that they're producing. But we would certainly love to chat with you based on your specific use case and your specific template to see if Equidox AI would be a good fit for your organization. So closing out of this document now I can also download all of these documents in a batch as well. So if I go to my Batches and Jobs Tab here, this is the batch that I just ran. So I would be able to extract all of these documents in a .zip file as well. So I'm able to download them as a .zip just like I delivered it to Equidox. But keep in mind you don't always have to run this process through this through this interface. We can run this process for you as a managed service, or the technology that's kind of going on behind the scenes can be seamlessly placed within your existing workflow so that we are not disrupting a longstanding process that you have for generating and delivering these PDF documents. Okay so just as a quick summary. Equidox is a leader in PDF remediation. This is all that Equidox does. We are entirely focused on PDF remediation. So we can solve your PDF remediation challenges whether it's a large or a small project. Many of you are here and this may be your first introduction to Equidox. Keep in mind that that is only part of what we offer. So the Equidox AI solution is our newest technology. But we also have Equidox Software that is meant for more unique one-of-a-kind ad hoc documents. So there are many organizations that are producing you know marketing flyers or just documents that are meant for kind of one-off use. They might not be producing them en mass. So Equidox software is designed to dramatically speed up and make more efficient the process of going through and tagging those documents that have completely unique elements where we can't necessarily train artificial intelligence to fully automate that process. We have tools built into that Equidox technology that are powered by artificial intelligence to speed things up, but there are certain examples of documents where you do need some sort of human intervention to analyze the PDF for context and be able to properly remediate it. So if you're interested in those types of use cases you can also feel free to reach out to us after the fact. We would love to talk to you about both or either or use case that you have in mind for your organization. And of course, we did just see the Equidox AI demonstration there. So you have a general idea of how that technology works. We will be sharing this slide deck out to everyone on the call after after the fact. So there are some relevant articles here if you are new to PDF accessibility or digital accessibility. If anything of anything presented today was confusing we will certainly give you some resources of some relevant articles and we do have a really large library of information on our website as well. So feel free to interact with these articles and check out some of our past webinars if you're interested in seeing our other technology. But also please feel free to reach out to us for a more personalized demonstration and discussion around your use cases. So with that said I'm going to pass it back over to Pat. We just have a minute or so left to to kind of wrap things up but thank you everyone for for attending today. [Pat Needles] Thank you, Dan. Great job great presentation. I hope that all of the folks here on the call today saw the great value in the Equidox AI tool. And we'd love to hear from anybody who feels as though there might be a proper use case not only for our AI solution but as Dan said our PDF ad hoc remediation software tool. And you could reach us at EquidoxSales@equidox.co, of course, you can always reach out to us by phone you see the number there, or if you would just like general information feel free just to visit us at www.Equidox.co. And that concludes today's presentation. For more information about how Equidox Software Company can help you with PDF accessibility, email us at EquidoxSales@Equidox.co or give us a call at 216-529-3030, or visit our website at www.equidox.co.

Webinar: Equidox AI Demonstration

Are you curious about automated PDF accessibility? Now you can witness Equidox AI in action!

Free 30-minute webinar where we’ll walk you through the process and offer a live demonstration using a variety of sample documents.

During the webinar, we’ll show the before-and-after results of Equidox AI’s automated remediation process, where our AI-powered models generate accurate tag trees at the click of a button.

Don’t miss the opportunity to witness this cutting-edge technology in action!

Download Presentation Deck

Let’s talk!

Speak with an expert to learn how Equidox solutions make PDF accessibility easy.