PlotDigitizer - How to Automatically Extract Data from Graph Image (#3)

PlotDigitizer
21 Apr 202107:52

Summary

TLDRThis video tutorial introduces the process of automatically extracting data from a graph image using PlotDigitizer. It emphasizes the efficiency of automation over manual extraction and outlines the three-step autotracing method involving color selection, mask application, and algorithm selection. The importance of calibration, the use of different mask types, and the need for minor adjustments due to the algorithms being in beta phase are also highlighted. The tutorial guides users through extracting red colored scattered points and curves, showcasing the software's capabilities and directing them to the official documentation for further information.

Takeaways

  • πŸ“ˆ Start by uploading the graph image to PlotDigitizer.com for data extraction.
  • 🎨 Calibrate the graph by setting X1, X2, Y1, and Y2 markers to known coordinates.
  • πŸ–ŒοΈ Choose the color of the data points or curves you wish to extract using the color picker tool.
  • 🎭 Apply a mask to the area of interest; options include pen mask and box mask for precise selection.
  • ✏️ Use the eraser mask to remove unwanted portions, such as overlapping or interfering elements.
  • πŸ” Select the appropriate algorithm for the type of data extraction; point, curve, or histogram algorithms are available.
  • πŸ”„ Algorithms are in beta, so minor adjustments may be necessary for accurate data extraction.
  • πŸ”„ For overlapping points, manual corrections can be made to ensure data accuracy.
  • πŸ“Š View extracted data in the dataset table and save it for further analysis or processing.
  • πŸ”„ Repeat the process for different colors or types of data within the same graph.
  • πŸ“ˆ Adjust point density for curves using the provided slider for a clearer representation.

Q & A

  • What is the main topic of the video tutorial?

    -The main topic of the video tutorial is how to automatically extract data from a graph image using PlotDigitizer.com.

  • Why is watching the earlier videos in the series important before starting this tutorial?

    -Watching the earlier videos is important because they provide foundational knowledge and context necessary for understanding the more advanced concepts taught in this video.

  • What is the first step in the automatic data extraction process?

    -The first step in the automatic data extraction process is to calibrate the graph by moving the calibration markers X1 and X2, and Y1 and Y2 to known X and Y coordinates.

  • How does one calibrate a graph in PlotDigitizer?

    -To calibrate a graph in PlotDigitizer, you use the zoom panel to drag and drop the calibration markers to the respective known X and Y coordinates on the graph.

  • What are the different types of masks available for autotracing in PlotDigitizer?

    -The different types of masks available for autotracing in PlotDigitizer are the pen mask and the box mask.

  • What is the purpose of the pen mask in autotracing?

    -The pen mask is used for extracting curves and provides more flexibility as you can control its thickness, making it ideal for precise tracing.

  • How can the eraser mask be utilized in the autotracing process?

    -The eraser mask is used to remove or eliminate unwanted portions from the masked region, such as noises or interfering objects with the same color as the object of interest.

  • What is the recommended algorithm for extracting points in PlotDigitizer?

    -For extracting points, the point algorithm is recommended in PlotDigitizer.

  • What should one do if the algorithms detect overlapping points as a single point?

    -If the algorithms detect overlapping points as a single point, manual corrections can be made to fix such minor issues.

  • How can one view the extracted data?

    -The extracted data can be viewed in the dataset table on the PlotDigitizer interface.

  • What is the process for extracting data from both curves and scattered points in the graph?

    -The process involves calibrating the graph, selecting the desired color for extraction, applying the appropriate mask, choosing the correct algorithm, and making manual adjustments if necessary. This is repeated for each type of data to be extracted.

Outlines

00:00

πŸ“Š Introduction to Automatic Data Extraction with PlotDigitizer

This paragraph introduces viewers to the video tutorial on PlotDigitizer.com, which focuses on the automatic extraction of data from graph images. It emphasizes the time-saving benefits of automation over manual data extraction and encourages viewers to watch previous tutorials for context. The video then proceeds to demonstrate the upload and calibration of a graph image, highlighting the importance of accurate calibration. The paragraph outlines the three-step process of autotracing: color picking, mask application, and algorithm selection. It also explains the use of the zoom panel for precision and the need for careful color picking to avoid errors in autotracing.

05:04

πŸ› οΈ Selecting and Applying Algorithms for Data Extraction

In this paragraph, the tutorial delves into the various algorithms available in PlotDigitizer for data extraction, such as cluster, points, curves, and histogram algorithms. It directs viewers to the official documentation for more information on these algorithms. The video then demonstrates the selection of the point algorithm for extracting points and acknowledges that the algorithms are in beta, which may require minor adjustments. The paragraph also addresses the issue of overlapping points being detected as one and suggests manual corrections. Finally, it mentions the ability to adjust point density for curves and teases that other algorithms will be discussed in subsequent videos.

Mindmap

The video is abnormal, and we are working hard to fix it.
Please replace the link and try again.

Keywords

πŸ’‘PlotDigitizer

PlotDigitizer is a software tool featured in the video that enables users to extract data from graph images automatically. It is the central theme of the tutorial, as the entire video is dedicated to demonstrating how to use this tool effectively. The script outlines various functionalities of PlotDigitizer, such as calibration, color picking, mask application, and algorithm selection for data extraction.

πŸ’‘Data Extraction

Data extraction is the process of obtaining raw data from various sources and converting it into a format that can be easily analyzed or used for further processing. In the context of the video, data extraction refers to the act of pulling numerical data from graph images, which can be done manually or, as shown, automatically using PlotDigitizer.

πŸ’‘Calibration

Calibration in the context of the video refers to the process of adjusting the scale and axes of the graph image in PlotDigitizer to match known values. This step is crucial for ensuring the accuracy of the data extracted from the graph image, as it sets the reference points for the X and Y coordinates.

πŸ’‘Color Picker

The color picker is a tool within PlotDigitizer that allows users to select a specific color from the graph image for autotracing. This feature is essential for the software to identify and extract data points or curves of a particular color, ensuring that the correct data is extracted.

πŸ’‘Mask

A mask in PlotDigitizer is a tool used to define the region of interest for data extraction. It can be applied to isolate specific data points or curves, allowing the user to focus on the relevant parts of the graph image. The script mentions two types of masks: pen mask and box mask, each serving a different purpose based on the shape and distribution of the data.

πŸ’‘Algorithm

In the context of the video, an algorithm refers to the set of rules or procedures used by PlotDigitizer to process and extract data from the graph image. Different algorithms are designed for different types of data, such as points, curves, histograms, and more. The choice of algorithm affects the accuracy and efficiency of the data extraction process.

πŸ’‘Autotracing

Autotracing is a feature in PlotDigitizer that automates the process of identifying and extracting data points or curves from a graph image based on the user's selected color and mask settings. It streamlines the data extraction process, saving time and effort compared to manual extraction methods.

πŸ’‘Dataset Table

The dataset table is an interface within PlotDigitizer that displays the extracted data points or values in a structured format. It allows users to review, edit, and save the extracted data for further analysis or use.

πŸ’‘Point Density

Point density refers to the number of data points or the level of detail in the representation of a curve or graph. In PlotDigitizer, adjusting the point density allows users to control the smoothness and precision of the extracted curves, which can be important for accurate data analysis.

πŸ’‘Eraser Mask

The eraser mask is a tool in PlotDigitizer used to remove or eliminate unwanted portions from the masked region during the data extraction process. It helps in cleaning up the data by erasing noise or interfering objects that may have been included in the initial mask.

πŸ’‘Algorithms (Cluster, Points, Curves, Histogram)

The video mentions various algorithms available in PlotDigitizer, each designed for extracting different types of data. Cluster, points, curves, and histogram algorithms cater to specific data extraction needs, allowing users to choose the most appropriate method for their analysis. The algorithms are in beta phase, which means they may require minor adjustments for optimal results.

Highlights

Introduction to PlotDigitizer's video tutorial series on automatic data extraction from graph images.

Manual data extraction is tedious; automation saves time and effort.

Uploading the graph image is the first step in the data extraction process.

Calibration of the graph is necessary, using known X and Y coordinates for markers X1, X2, Y1, and Y2.

The coordinates of the cursor are displayed after calibration, aiding in precise data extraction.

Autotracing, or automatic extraction, is divided into three simple steps for ease of use.

The first step in autotracing is selecting the color of the data points to be extracted using the color picker.

Masking is the second step, where you apply either a pen mask or a box mask to the area from which data is to be extracted.

The pen mask offers flexibility for extracting curves, while the box mask is suitable for large, scattered objects.

The eraser mask is used to remove unwanted portions such as noises or interfering objects from the masked area.

Algorithm selection is the final step, with options like cluster, points, curves, and histogram.

PlotDigitizer's algorithms are in beta, and minor adjustments may be needed for accurate data extraction.

Extracted data points can overlap, and manual correction may be necessary for precise results.

All extracted data is displayed on a dataset table for review and saving.

Adjusting point density is possible for curves to refine the extracted data.

The tutorial series will cover additional algorithms in upcoming videos.

Transcripts

play00:01

Hello everyone,

play00:02

Welcome to PlotDigitizer.com. In this video tutorial, we will learn how to automatically

play00:09

extract data from the graph image.

play00:13

Before we begin, this tutorial is a part of the video series to a complete guide on PlotDigitizer.

play00:19

If you haven’t watched the earlier videos, please do so.

play00:25

In one of the previous videos, we have experienced extracting data in the manual mode is a very

play00:30

tedious and unexciting exercise. Automating data extraction can save a lot of your time

play00:36

and effort.

play00:41

Before we move forward, let's upload the graph image.

play00:45

As we can see it is an X Y graph.

play00:52

There are two curves. And there are scattered points.

play00:58

First, we have to calibrate this graph.

play01:02

Move the calibration markers X1 and X2 to two different positions where we know X coordinates.

play01:13

And the same for Y1 and Y2.

play01:20

Use the zoom panel while dragging and dropping the markers.

play01:39

Now, the graph has been calibrated. You can see the coordinates of the cursor beneath

play02:00

the zoom panel.

play02:02

To make things simple, automatic extraction, or what I call autotracing, can be divided

play02:08

into three simple steps.

play02:10

The first step in automatic extraction is picking the color. You have to pick the color

play02:16

of the object that you want to extract.

play02:18

So, in this graph, I first wish to extract red colored scattered points.

play02:26

Pick the color of the points using the color picker from the menu bar.

play02:32

Use the zoom panel while color picking.

play02:35

Otherwise, you might end up picking the wrong shade of color. And in that case, the autotracing

play02:40

might not work as expected.

play02:45

Once picked, you can see the color over on the icon.

play02:48

Now, the second step is to add the mask to the region from which we want to extract data.

play02:55

In the case of our example, we have to apply the mask to the entire area occupied by scattered

play03:01

points.

play03:02

Now, there are two types of masks available to us.

play03:06

The first is the pen mask.

play03:09

And the second is the box mask.

play03:12

The box mask, as the name says, overlays the image with a box.

play03:17

You can apply the box multiple times if you want.

play03:23

On the other hand, the pen mask is like a pen.

play03:27

You can control the thickness of the pen mask.

play03:33

You can increase the thickness

play03:40

And You can also decrease the thickness.

play03:50

The pen mask is best for extracting curves, since it gives you more flexibility, while

play03:55

the box mask is perfect in situations where objects are spread over a large portion of

play04:00

the image, like scattered points.

play04:04

The third option is the eraser mask. It is an eraser.

play04:09

And you can also change the size of the eraser with this slider.

play04:13

The erase mask is applied when you want to remove or eliminate the unwanted portion from

play04:18

the masked region.

play04:21

The unwanted portions can be noises, as this legend.

play04:27

Or it could be any interfering objects having the same color.

play04:32

In our graph, this red curve is interfering with the red-colored points.

play04:36

So, let’s erase this curve from the masked portion.

play04:42

The final option clears everything.

play04:49

We wish to extract red colored points first, so let’s mask them.

play05:04

After masking, we have to select the right algorithm.

play05:09

PlotDigitizer is packed with several algorithms, like cluster, points, curves, histogram, etc.

play05:15

There are also vertical and horizontal bar algorithms, which we will cover in one of

play05:22

the next videos. You can find more about each of these algorithms in our official documentation.

play05:28

Visit plotdigitizer.com/docs to know more.

play05:34

Since we are extracting points, we will select the point algorithm.

play05:38

Done. All the points are extracted.

play05:42

Note to consider here, the algorithms are still in the beta phase, so sometimes you

play05:46

might need to do minor adjustments.

play05:50

In our case, there are two points overlapping over here, but they are detected as one single

play05:56

point.

play05:57

You can always manually fix such minor corrections.

play06:05

All the extracted data can be seen on the dataset table.

play06:10

We will save them first and then proceed to the others.

play06:14

We will repeat the same steps for the rest.

play06:26

For the curve, you can increase or decrease the point density with this slider.

play07:09

Now, we have extracted both curves and scattered points. There are other algorithms too, which

play07:44

we will discuss in the next videos.

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
DataExtractionPlotDigitizerAutotracingGraphCalibrationColorSelectionMaskingToolsAlgorithmsDataAnalysisEfficiencyTutorial