How to Improve OCR Accuracy

Scan2CAD
21 Sept 202109:28

Summary

TLDRThe video script provides a detailed tutorial on using Scan2CAD's OCR functionality to convert raster images with text into editable vector text. It emphasizes the importance of high-quality, clear images for optimal OCR results and discusses the manual cleanup of small or touching text elements that may hinder recognition. The script guides viewers through setting OCR parameters, including character size and confidence levels, and offers tips for handling intersecting lines and vertical text. It concludes with manual editing techniques for perfecting the conversion and exporting the final vectorized document.

Takeaways

  • šŸ” Use high-quality, high-resolution images with minimal pixelation and blurriness for the best OCR results.
  • šŸ“š Ensure the text in the image is clear and easy to read for effective text recognition.
  • šŸ–‹ Handwritten or stylized text may not be recognized well by OCR, especially if it's not clear.
  • šŸ” Small text details, like holes in letters, can be lost, affecting OCR accuracy.
  • šŸ‘€ Scan2CAD's OCR functionality can convert raster text into editable vector text.
  • āœ‚ļø Manual editing may be necessary for text that is too close or has intersecting lines.
  • šŸ”¢ Set the maximum character size using 'Select from Image' for accurate OCR settings.
  • šŸ“ The minimum character size is usually calculated automatically but can be adjusted manually.
  • šŸ“ˆ Minimum confidence level determines the display of text objects based on their recognition certainty.
  • šŸ”„ Character rotation options should be used based on the presence of vertical or angular text.
  • šŸŒ Choose the appropriate language and document type for more accurate OCR results.
  • šŸ–¼ļø After conversion, manually adjust and clean up the text for optimal results.
  • šŸ–Šļø Use the 'Draw Text' tool to replace or complete text that wasn't converted properly.
  • šŸ–¼ļø The 'Highlight Vectors' feature helps in identifying and editing different vector elements.

Q & A

  • What is the main issue with converting raster text to vector lines using automatic conversion programs?

    -The main issue is that the text often ends up as vector polylines, which are not editable as true type vector text.

  • What does OCR stand for and what does it do in the context of Scan2CAD?

    -OCR stands for Optical Character Recognition. In Scan2CAD, it recognizes the text objects in a raster image and converts them into editable true type vector text.

  • What are the key factors to consider when choosing a document for automatic conversion?

    -The document should have good quality, high resolution, minimal pixelation, and blurriness, with clear and easy-to-read text.

  • Why might Scan2CAD struggle with converting handwritten text?

    -Handwritten text, especially if stylized, may not be recognized well by Scan2CAD due to variations in handwriting and potential pixelation issues.

  • What can be done if the text in the image is too small or lacks fine details?

    -For smaller text, you can manually erase parts that are touching or add details like holes in letters to make them more recognizable for Scan2CAD.

  • What is the purpose of setting the maximum character size in Scan2CAD's OCR functionality?

    -Setting the maximum character size helps Scan2CAD to identify and convert the largest characters in the image, which in turn assists in automatically calculating the minimum character size.

  • What does the minimum confidence level in OCR represent?

    -The minimum confidence level represents the certainty of Scan2CAD in converting text objects. Text objects below this level may not be displayed if they do not meet the set confidence threshold.

  • Why is it important to consider character rotation settings in OCR?

    -Character rotation settings are important to ensure that Scan2CAD can recognize and convert text that is not only horizontal but also vertical or at an angle.

  • What should be the default document type setting for technical drawings in Scan2CAD?

    -The default document type setting for technical drawings should be 'technical' to optimize the OCR conversion process.

  • How can you manually correct the converted text in Scan2CAD if it's not accurate?

    -You can use the 'Highlight Vectors' feature to see the converted vectors, then use the erase tool to remove inaccurate parts and the text tool to manually add or correct the text.

  • What is the recommended minimum confidence level setting for most conversions in Scan2CAD?

    -The recommended default minimum confidence level is 60, which should be used unless there is a specific reason to adjust it.

Outlines

00:00

šŸ“š Advanced OCR Functionality in Scan2CAD

This paragraph discusses the process of converting raster text to editable vector text using Scan2CAD's Optical Character Recognition (OCR) feature. It emphasizes the importance of high-quality, high-resolution images with clear, easily readable text for optimal OCR results. The script mentions challenges with stylized handwriting and small text details, such as holes in letters, that may not be recognized properly. It provides tips for cleaning up images, such as using the erase tool to separate touching letters and restore details. The paragraph also explains the OCR settings, including character size selection, minimum character size, confidence level, character rotation, and language options, to ensure the best conversion results.

05:08

šŸ–Œļø Post-OCR Image Cleanup and Finalization

The second paragraph focuses on the post-OCR conversion steps in Scan2CAD. It describes generating a preview to assess the OCR results, noting issues with letters that were too close together and not converting well. The script suggests adjusting the minimum confidence level setting and provides a caution against setting it too high, which could result in the disappearance of text objects. It also touches on the option to handle intersecting lines and the manual correction process, including using the 'Highlight Vectors' feature, erasing incorrect vector lines, and adding true type text. The paragraph concludes with the finalization of conversions, adjusting text placement, and exporting the cleaned-up file in the desired format, showcasing Scan2CAD's capability to convert technical images with text effectively.

Mindmap

Keywords

šŸ’”Automatic Conversion Programs

Automatic conversion programs refer to software tools that convert documents or images into different formats, often from raster to vector. In the context of the video, these programs are used to transform documents with raster text into editable vector text. The script mentions that using such programs can result in vector polylines instead of clean text, which is not always desirable.

šŸ’”Raster Text

Raster text is a type of text representation in images where the text is composed of pixels, similar to how photographs are made up. The video script discusses the challenges of converting raster text into vector text, emphasizing the need for high-quality images to ensure accurate conversion.

šŸ’”Vector Text

Vector text is text that is represented by mathematical formulas, allowing it to be scaled infinitely without losing quality. The script explains that the goal of using Scan2CAD's OCR functionality is to convert raster text into editable vector text, which is more desirable for editing and manipulation in technical drawings.

šŸ’”Optical Character Recognition (OCR)

OCR is a technology that enables the conversion of various types of documents, such as scanned paper documents, PDF files, or images captured by a digital camera, into editable and searchable data. In the video, OCR is highlighted as Scan2CAD's advanced feature for recognizing text objects in raster images and converting them into editable vector text.

šŸ’”High Resolution

High resolution refers to the quality of an image, where more pixels are packed within a certain area, resulting in a clearer and more detailed image. The script stresses the importance of using high-resolution images for conversion to minimize pixelation and blurriness, which can affect the OCR process.

šŸ’”Pixelation

Pixelation is the result of an image being composed of large, visible pixels, often occurring in low-resolution images. The video script mentions that minimal pixelation is desired for the OCR process to work effectively, as pixelation can obscure the details of the text and hinder accurate recognition.

šŸ’”Handwritten Text

Handwritten text refers to text that is written by a person, as opposed to printed or typed text. The script notes that handwritten text may not be well-suited for OCR conversion, especially if the handwriting is stylized or unclear, which can complicate the recognition process.

šŸ’”Minimum Confidence Level

The minimum confidence level is a measure of how sure the OCR software is about the accuracy of the text it has recognized. The script explains that Scan2CAD assigns a confidence level to each text object, and a higher level indicates a more accurate conversion. The user can set a minimum confidence level to filter out text that is below a certain threshold of certainty.

šŸ’”Character Rotation

Character rotation in the context of OCR refers to the ability of the software to recognize and correctly orient text that is not horizontal. The script mentions that Scan2CAD allows users to select whether to include recognition for horizontal, vertical, or angular text, which can be important for certain types of documents.

šŸ’”Vectorizing OCR

Vectorizing OCR is the process of using OCR technology to convert raster images of text into vector format. The script describes the steps to initiate this process in Scan2CAD, including setting the maximum character size and using the OCR tab to configure settings for optimal text recognition and conversion.

šŸ’”Intersecting Lines

Intersecting lines refer to lines in a drawing or image that cross each other. The script mentions an option in Scan2CAD for handling intersecting lines, which can be useful in niche situations where text or other elements are obscured or incorrectly recognized due to lines crossing over them.

Highlights

Automatic conversion programs often create vector polylines from raster text, which is not always desirable.

Scan2CAD has advanced OCR functionality to convert raster text into editable vector text.

High-resolution, clear images with minimal pixelation and blurriness are recommended for conversion.

Handwritten, stylized text may not work well with OCR due to recognition challenges.

Small text details like holes in letters can be lost, affecting OCR accuracy.

Letters touching each other may be recognized as a single pixel cluster, not individual text.

Manually editing the image to separate touching letters can improve OCR results.

Clicking 'Convert Raster Image' and selecting 'Vectorizing OCR' initiates the OCR process.

Setting the maximum character size using 'Select from Image' helps in accurate OCR.

Minimum character size is automatically calculated or can be set manually.

Minimum confidence level determines the display of text objects based on OCR certainty.

High confidence levels are assigned to clear, minimally pixelated text.

Adjusting the minimum confidence level can affect the visibility of converted text.

Character rotation settings can be customized for horizontal, vertical, or angular text.

Language and document type selections are crucial for accurate OCR results.

Running the OCR generates a preview of the converted text.

Post-OCR, manual adjustments can be made to clean up the converted text.

Using 'Highlight Vectors' helps in identifying and correcting converted vector objects.

Finalizing conversions and exporting the file completes the OCR process.

Scan2CAD's OCR can effectively convert various technical images with text.

Transcripts

play00:04

[music]

play00:04

Usually, when you use automaticĀ  conversion programs to convertĀ Ā 

play00:08

documents like this one that have raster text,Ā  you're gonna end up with a lot of vector lines,Ā Ā 

play00:14

and even the text ends up as vector polylines.Ā  And sometimes you don't want that, you want toĀ Ā 

play00:19

convert them into actual editable two-typeĀ  vector text. Luckily enough Scan2CAD hasĀ Ā 

play00:28

really advanced OCR functionality, which justĀ  means Optical Character Recognition, which shouldĀ Ā 

play00:34

recognize the text objects in this raster imageĀ  and convert them into editable two-type text.Ā Ā 

play00:42

Now with any file that you want to convertĀ  automatically, you're gonna want to get the fileĀ Ā 

play00:47

that has good quality, high resolution, minimalĀ  pixelation, and blurriness. As for the text in theĀ Ā 

play00:57

image, you're gonna want to make sure thatĀ  they're really clear and easy to read.Ā Ā 

play01:02

Sometimes, you're gonna have aĀ  document with handwritten text.Ā Ā 

play01:05

And that doesn't really work well, especially ifĀ  the handwriting is a little bit more stylized.

play01:12

For the smaller text, you can seeĀ  that some of them don't really haveĀ Ā 

play01:18

fine details, the holes in the A's are erased,Ā  for example. And some of the left letters areĀ Ā 

play01:24

a bit too close to each other so much soĀ  that the pixels are actually touching,Ā Ā 

play01:30

Scan2CAD might not recognize this very readily.Ā  If these letters are touching, Scan2CAD recognizesĀ Ā 

play01:41

as one single pixel cluster, so they won'tĀ  be recognized as individual text. If you canĀ Ā 

play01:49

get images that don't have these types of texts,Ā  or you can manually erase the parts that areĀ Ā 

play01:55

touching by clicking on the erase tool here, andĀ  just going over and making gaps between these twoĀ Ā 

play02:06

letters. You can even put some holes in the A'sĀ  here. In any case, if you have a better qualityĀ Ā 

play02:16

document, you shouldn't beĀ  having those problems here.Ā Ā 

play02:20

Once you've had some time to clean up the image,Ā  you can click on "Convert Raster Image". Click onĀ Ā 

play02:29

"Vectorizing OCR". Once you click that, there'sĀ  an OCR tab that pops up here, click on that.

play02:35

You wanna set the maximum character size, you canĀ  click on "Select from Image", look for the largestĀ Ā 

play02:41

characters here, which I think might be theĀ  room label. So let's zoom in on any one of them.Ā Ā 

play02:47

Click and drag from the top to theĀ  bottom or from the bottom to the top.Ā Ā 

play02:53

Once you let go, it automaticallyĀ  sets the maximum character size here.Ā Ā 

play02:58

And by default, the minimum character sizeĀ  will also be calculated automatically from theĀ Ā 

play03:03

maximum character size. You can set it upĀ  manually by un-ticking this one and justĀ Ā 

play03:10

typing it in. But usually, you're gonna wannaĀ  keep that ticked to get the best results.

play03:20

Another thing I want to talk aboutĀ  is the minimum confidence level.Ā Ā 

play03:24

Basically based on the quality of theĀ  image, based on the pixelation, based on theĀ Ā 

play03:28

quality of the individual raster text, Scan2CADĀ  assigns all of these text objects a certainĀ Ā 

play03:36

confidence level in percent. If it's sure that itĀ  converted it properly, because it's a clear word,Ā Ā 

play03:42

and has minimal pixelation, it has very clearĀ  letters, then it's gonna assign it a highĀ Ā 

play03:49

confidence level. If it's a little bit moreĀ  blurry, if it's not super clear, it's gonnaĀ Ā 

play03:54

assign it a lower confidence level. This minimumĀ  percentage here just makes it so that any textĀ Ā 

play04:02

object that's below this minimum confidence levelĀ  won't be displayed. The default value is 60,Ā Ā 

play04:11

if you set it to 90 for example, onceĀ  you convert the file, a lot of theseĀ Ā 

play04:18

text objects might start to disappear becauseĀ  they don't reach the minimum confidence level thatĀ Ā 

play04:25

you set here. For character rotation, by default,Ā  horizontal is ticked and vertical angular are not.Ā Ā 

play04:34

If you have any vertical or angularĀ  text here, then you might as wellĀ Ā 

play04:37

take them but as much as possible leave themĀ  unticked if you don't need to use them anyway.

play04:43

Because if you leave them ticked, itĀ  might create some false positives.Ā Ā 

play04:47

As for the language, you can choose between allĀ  of the languages that we have here. By default isĀ Ā 

play04:53

going to be English, and for the document typeĀ  and technical drawings such as this one shouldĀ Ā 

play04:58

be set the technical and if you have a mostlyĀ  text-based raster file, then click on text here.Ā Ā 

play05:08

With all these settings, set the defaultĀ  and just put the maximum character sizeĀ Ā 

play05:11

ready here, then click on "Run" toĀ  generate a preview. See what we get.Ā Ā 

play05:20

It's pretty clean. But like I expected,Ā  some of these letters that were tooĀ Ā 

play05:28

close to each other didn't convertĀ  very well. But as you can see,Ā Ā 

play05:33

the word here that we took the time to clean up,Ā  which is arched, converted with no problem. Again,Ā Ā 

play05:41

it's just a matter of cleaningĀ  up the images as much as you can.Ā Ā 

play05:45

Like I mentioned before, if you set this toĀ  let's say the highest possible value of 99,Ā Ā 

play05:52

Click on "Run", you're gonna see aĀ  lot of these words start to disappear.Ā Ā 

play05:58

Like so, 'cause nothing can go higher thanĀ  99. So stick to the default level at 60.

play06:11

Also, we have an option I forgot to mentionĀ  that has intersecting lines. I think it'sĀ Ā 

play06:15

just for these niche situations where someĀ  of these others are kind of intersecting.Ā Ā 

play06:23

Again, this isn't the most ideal placementĀ  for a letter, so this should help,Ā Ā 

play06:30

but I think for this particularĀ  one, it's just too...Ā Ā 

play06:36

Well, for one, the line that's intersecting isĀ  too thick and it's intersecting where there areĀ Ā 

play06:41

two points. So this might not be recognized,Ā  but let's see if it helps. I think for certainĀ Ā 

play06:50

letters that are more legible, it should help.Ā  Yeah, no, it didn't really help that much,Ā Ā 

play06:58

but I think for these other ones, it kind of did.Ā  Okay. Let's click on "Okay" to finalize all ofĀ Ā 

play07:03

these conversions. Now, if you want to manuallyĀ  change this, you can go to the both tab here,Ā Ā 

play07:12

click on "Highlight Vectors" to see theĀ  converted vectors on top of the original raster.

play07:19

If you wanna turn this into a rasterĀ  object, rather no, a true type object,Ā Ā 

play07:25

you can click on the erase tool, erase theĀ  lines that were converted, click on theĀ Ā 

play07:31

text tool. Actually, I wanna check how largeĀ  these characters are. So I'm gonna click on one,Ā Ā 

play07:38

vector information says that theĀ  size is 20. So I'm going to click onĀ Ā 

play07:42

"Draw Text", click here, set the size toĀ  20, and then just type in the word down.Ā Ā 

play07:54

Let's click on the arrow tool and just dragĀ  this to its proper placement over here.Ā Ā 

play08:02

What else do we wanna change? Okay,Ā  for example, we have this one whereĀ Ā 

play08:06

part of the word was converted, butĀ  the rest wasn't. You can actually justĀ Ā 

play08:11

erase the part of the word that wasn't convertedĀ  back to the both tab to see the reference.Ā Ā 

play08:18

Click on the arrow tool, click on theĀ  R, and then add the rest of the word.Ā Ā 

play08:27

Click on okay. Move it a little bitĀ  'cause it's intersecting the next word,Ā Ā 

play08:32

so just move it over here like this. You canĀ  take your time and clean up the whole image.

play08:40

Also, if you click on the "Highlight Vectors"Ā  button here, it just makes all of theĀ Ā 

play08:45

unique vector line types into different colors.Ā  So, polylines are red, arcs are magenta,Ā Ā 

play08:52

text objects are also magenta, as you can see,Ā  dashes are black, and if you're happy with this,Ā Ā 

play09:00

un-tick the "Highlight Vectors" button, andĀ  you can click on export here in the upperĀ Ā 

play09:04

right to just save the file into whatever fileĀ  format it is that you wanna use. With the OCRĀ Ā 

play09:10

functionality of Scan2CAD, you can convertĀ  many various types of technical images thatĀ Ā 

play09:16

have text on them with no problem, you don'tĀ  have to worry about the polythyne text anymore.

Rate This
ā˜…
ā˜…
ā˜…
ā˜…
ā˜…

5.0 / 5 (0 votes)

Related Tags
OCR ConversionVector TextRaster ImageImage EditingTechnical DrawingsText RecognitionScan2CADImage QualityEditable ContentManual Cleanup