Scrape website data without code using Bardeen

Harv the workload
15 Jan 202310:12

Summary

TLDRIn this informative tutorial, the speaker introduces Bardeen, a powerful and free Chrome extension designed to automate repetitive tasks, particularly useful for data scraping. The tool is highlighted for its ability to streamline workflows by copying and pasting or triggering automated processes. The demonstration showcases how Bardeen can extract data from a property search website, Rightmove, to create a spreadsheet without needing to build an integration. The step-by-step guide covers setting up a scraper, creating an auto book for right-click triggers, and finally, integrating the scraped data into Google Sheets. The video concludes by encouraging viewers to explore Bardeen's potential for market research and data analysis.

Takeaways

  • πŸ•΅οΈ The speaker spends a lot of time scraping data from websites and prefers using APIs or other methods when available.
  • πŸ”§ Bardeen is a free Chrome extension that automates repetitive tasks and is particularly useful for data scraping and workflow setups.
  • 🏠 The speaker uses Bardeen to scrape data from e-commerce platforms and courier services to streamline order creation and shipment tracking.
  • πŸ”— Bardeen can extract specific data from a webpage, such as property details, by identifying HTML elements and creating a scraper template.
  • πŸ“ˆ The speaker demonstrates using Bardeen on Rightmove, a property search website, to extract information from property listings.
  • πŸ“ Bardeen allows users to create an 'auto book' with triggers and actions, enabling automated data scraping when certain conditions are met.
  • πŸ”— The auto book can be set to scrape data in the background when a user right-clicks on a link, without loading the page.
  • πŸ“‹ Data scraped by Bardeen can be saved directly to Google Sheets, streamlining the process of data collection and analysis.
  • πŸ›  The speaker emphasizes the flexibility of Bardeen, suggesting it can be used for various tasks beyond the examples provided.
  • πŸ“Š Bardeen is recommended for those conducting market research or needing to pull large datasets into a spreadsheet for analysis.

Q & A

  • What is the main purpose of the tool Bardeen mentioned in the script?

    -Bardeen is a free Chrome extension designed to automate repetitive manual tasks, particularly useful for data scraping and setting up triggered workflows on websites.

  • Why might someone prefer using an API over scraping data from websites?

    -Using an API is often preferred because it provides a more structured and reliable way to access data from websites, avoiding the need for manual scraping which can be time-consuming and less efficient.

  • How does Bardeen help with scraping data from e-commerce platforms?

    -Bardeen can be used to scrape data from e-commerce platforms to create orders in one's system without the need to build a custom integration, streamlining the process.

  • What is an example of a website where Bardeen can be used to extract information?

    -The script mentions using Bardeen on Rightmove, a property search website, to extract information from standard property pages.

  • What specific data points can Bardeen help extract from a webpage?

    -Bardeen can help extract specific data points such as address, price, property type, number of bedrooms, number of bathrooms, property size, and tenure.

  • How does Bardeen identify specific HTML elements to scrape?

    -Bardeen identifies specific HTML elements by allowing the user to click on them and assign a title, which helps in customizing the data extraction process.

  • What additional feature did the speaker find useful to add to Bardeen's data extraction?

    -The speaker found it useful to add the URL and timestamp to the data extraction, providing a direct link to the property and indicating when the data was scraped.

  • How can Bardeen's auto book feature be triggered?

    -The auto book feature in Bardeen can be triggered by a user action, such as a right-click on a link, which initiates the scraping process.

  • What is the process of saving scraped data to Google Sheets using Bardeen?

    -After setting up the scraper and auto book in Bardeen, the user can add an action to save the scraped data to a Google Sheet by specifying the sheet and how the data should be added.

  • How can Bardeen help in analyzing large datasets for market research?

    -Bardeen can help by automating the process of pulling data into a spreadsheet, allowing for quicker analysis of large datasets, which is particularly useful for market research.

Outlines

00:00

πŸ€– Introduction to Bardeen Tool for Web Scraping

The speaker introduces Bardeen, a free Chrome extension that automates repetitive tasks, particularly useful for web scraping. They express their preference for APIs but acknowledge the necessity of scraping in certain situations. The tool is highlighted for its ability to streamline the process of copying and pasting data or setting up triggered workflows. The speaker shares their experience using Bardeen with partner platforms to scrape e-commerce data for order creation and to pull courier data for shipment tracking. The tutorial begins with a demonstration on the Rightmove property search website, aiming to extract property information efficiently.

05:02

πŸ” Setting Up Bardeen for Property Data Scraping

The speaker demonstrates how to set up Bardeen for scraping property data from Rightmove. They guide through the process of initiating the app, creating a new scraper template, and identifying specific HTML elements to extract data such as address, price, property type, number of bedrooms and bathrooms, size, tenure, and a special field for the URL and timestamp. The focus is on creating a single page scraper tailored to gather the desired information from property listings. The setup includes an auto book feature triggered by a right-click action, which is intended to scrape data from the selected property page in the background.

10:03

πŸ“Š Automating Data Extraction and Integration with Google Sheets

The speaker continues by detailing the automation process of data extraction using Bardeen and integrating it with Google Sheets. They explain creating an action that triggers upon a right-click, which then scrapes the data from the property page in the background. The data is then saved to a spreadsheet, streamlining the process of compiling property listings. The tutorial includes connecting Bardeen with Google Sheets, setting up the action to add rows to the spreadsheet, and testing the workflow to ensure data is correctly pulled and organized. The speaker concludes by encouraging viewers to explore Bardeen's capabilities for their own data scraping and market research needs, highlighting its utility beyond the demonstrated use case.

Mindmap

Keywords

πŸ’‘Scraping data

Scraping data refers to the process of extracting information from websites programmatically. In the context of the video, the speaker is not satisfied with the manual process of scraping data from websites and is seeking more efficient methods. The term is central to the video's theme as it introduces the need for automation tools like Bardeen to streamline the data extraction process.

πŸ’‘API

API stands for Application Programming Interface, which is a set of rules and protocols that allows different software applications to communicate with each other. The video mentions APIs as a preferred method for engaging with websites over manual scraping, indicating that APIs are more efficient and often provide a structured way to access data.

πŸ’‘Bardeen

Bardeen is a free Chrome extension mentioned in the video that automates repetitive tasks, particularly useful for data scraping and workflow automation. It is highlighted as a solution to the problem of inefficient manual data scraping, showcasing its power in automating tasks on websites like Rightmove for property data extraction.

πŸ’‘E-commerce platforms

E-commerce platforms are online systems that allow for the buying and selling of goods and services. The script refers to using Bardeen to scrape data from e-commerce platforms to create orders in their system, which exemplifies a practical application of Bardeen in automating data transfer for business processes.

πŸ’‘Courier data

Courier data pertains to information related to the shipment and delivery of goods by courier services. The video script mentions using Bardeen to pull courier data to keep track of shipments, illustrating how the tool can be used to automate and update logistics information.

πŸ’‘Rightmove

Rightmove is a UK-based property search website. The video script uses Rightmove as an example to demonstrate how Bardeen can be used to scrape data from property listings. This keyword is significant as it provides a specific context for showcasing Bardeen's capabilities.

πŸ’‘Scraper template

A scraper template in the context of Bardeen is a pre-configured set of instructions that dictate how data should be extracted from a webpage. The video describes creating a 'Rightmove single page' scraper template to define the structure and data points to be scraped from Rightmove's property listings.

πŸ’‘Auto bot

An auto bot in Bardeen is a feature that allows for the automation of tasks based on triggers. The video script discusses setting up an auto bot with a trigger for right-click actions on webpage links, which automates the process of scraping data from selected properties.

πŸ’‘Google Sheets

Google Sheets is a web-based spreadsheet program that is part of Google's office suite. The video script explains how to use Bardeen to export scraped data directly into a Google Sheet, which is a key feature for organizing and analyzing the extracted data.

πŸ’‘Integration

Integration in the context of the video refers to the process of combining different systems or applications to work together. The speaker mentions avoiding the need to build an integration by using Bardeen to scrape data from partner platforms, which simplifies the process of data transfer between systems.

πŸ’‘Workflow automation

Workflow automation is the process of making a sequence of tasks or jobs within a business or system to be automated. The video demonstrates how Bardeen can be used to automate workflows, such as scraping property data and saving it to a spreadsheet, to increase efficiency and reduce manual labor.

Highlights

Scraping data from websites can be time-consuming and frustrating without an API.

Bardeen is a free Chrome extension for automating repetitive tasks, especially for data scraping.

The speaker uses Bardeen to scrape data from e-commerce platforms for order creation.

Bardeen is also used to pull courier data to keep track of shipments.

A demonstration of using Bardeen on Rightmove, a property search website, is provided.

Bardeen allows for the creation of a new scraper template for specific tasks.

HTML elements on a webpage can be identified and customized for data extraction.

Including the URL and timestamp in the data scrape can be useful for tracking property listings.

Autobooks in Bardeen can be triggered by specific actions, like a right-click on a link.

Creating actions within Bardeen to scrape data in the background without loading the page.

Testing the Bardeen setup to ensure the data is being scraped correctly.

Integrating Bardeen with Google Sheets to automatically save scraped data.

The process of connecting Bardeen with Google Sheets for the first time requires agreeing to terms.

Using Bardeen to add rows to a Google Sheet with data from the scraper.

The speaker emphasizes the potential of Bardeen for market research and data analysis.

Bardeen can automate the process of pulling large datasets into a spreadsheet for analysis.

The speaker invites viewers to share their creative uses of Bardeen in the comments.

Transcripts

play00:00

I spend an awful lot of time

play00:02

scraping data from websites there I've

play00:04

said it I'm not happy about it I wish

play00:06

there were better ways to do it and

play00:07

often when there's an API or any other

play00:10

way for me to engage in a website I'm

play00:11

going to use that instead but sometimes

play00:13

it's not avoidable and sometimes it's

play00:16

not worth the time this is where

play00:17

something like bardeen the tool I'm

play00:19

going to be talking about today is

play00:20

extremely useful bardeen is a free

play00:23

Chrome extension that packs a lot of

play00:24

power it lets you automate repetitive

play00:26

manual tasks and it's especially useful

play00:28

for copying and pasting or setting up

play00:29

triggered workflows obviously this video

play00:32

isn't sponsored but I have left a link

play00:34

in the description so you can check it

play00:35

out for yourself now I've been using

play00:36

this recently with some of our partner

play00:39

platforms to scrape data from their

play00:41

e-commerce platforms so that we can

play00:43

create orders in our system without

play00:44

having to build an integration I've also

play00:46

used it to pull Courier data so that we

play00:48

can keep an update on all of our

play00:50

shipments that are happening at any time

play00:52

today though I'm going to use this on

play00:53

right move which is a property search

play00:55

website I'm Keen to get some information

play00:57

out of their standard property page and

play01:00

a tool like bardeen is really really

play01:01

great for so let's waste no more time

play01:04

and I will jump into the platform and

play01:05

show you how it works

play01:08

okay so here we are on the right move

play01:10

site you'll see there's a nice big old

play01:11

search box here when I went to

play01:13

University I lived in Leamington Spa so

play01:15

that seems like a great place for us to

play01:16

start so let's just have a really quick

play01:18

look I mean Tom no I mean Tom Spa in

play01:22

Warwickshire I'm gonna look at houses

play01:24

for sale and I'm just going to use

play01:26

whatever standard setting they have I

play01:29

just want to get a list of links and

play01:30

have a go so I've done my search now for

play01:33

for lemington spa I've got 360 results I

play01:35

don't want all of them what I want to do

play01:37

is just make sure that we get the

play01:38

scraping set up properly

play01:40

in a way that's the most useful so the

play01:42

way I'm going to start this is by

play01:44

clicking one of these links and taking a

play01:45

look at the page so loads of pictures

play01:47

which is great we've got these key

play01:49

features here which may be a useful the

play01:51

property description I'm not too fast

play01:53

around but there's definitely some

play01:55

things that I want to make sure that we

play01:56

have for our spreadsheet but yeah let's

play02:00

let's have a look and see what data we

play02:03

can pull out so all you need to do to

play02:04

start up the the app itself is click

play02:06

here on your extension list and that's

play02:09

going to come up with this splash screen

play02:11

for now all I want to do is play around

play02:12

with the scraper and then we can worry

play02:14

about linking that to Google Sheets a

play02:15

little bit later so I'm going to come in

play02:17

here and click on new scraper template

play02:19

that's going to offer either a list

play02:21

table or single page but for now we're

play02:23

on a single page so I'm going to grab

play02:25

this I'm going to call this right move

play02:28

single page

play02:30

start building so this bit down at the

play02:33

bottom is going to offer me some

play02:35

information from the table but what

play02:37

we're going to do first off is just have

play02:39

a look around the page and see what we

play02:41

can click on see what information we can

play02:42

actually put out so you'll see here it's

play02:44

actually identifying specific HTML

play02:47

elements that we want to look at all you

play02:50

have to do is click on it and give it to

play02:51

feel the title I'm going to call this

play02:53

address we're going to add here which is

play02:55

the Bliss price

play02:58

property type

play03:02

number of bedrooms

play03:05

the number of bathrooms

play03:08

and the size

play03:12

tenure

play03:14

pretty sure they're all free health but

play03:15

I don't really know enough to say that

play03:17

with any certainty Harvey from very

play03:20

slightly in the future here uh one of

play03:22

the things that we didn't add here which

play03:24

is obviously very useful to add is the

play03:26

URL you'll see down here and special

play03:28

field URL time Etc generally it's quite

play03:31

nice to add that so I am going to get a

play03:33

page link which is the URL and I am

play03:36

actually going to have a look and see

play03:37

about maybe just also grabbing time

play03:39

scrapes the only reason I'm going to

play03:41

grab this is just because obviously

play03:43

we're looking at properties and if

play03:44

something's really old the chances are

play03:46

that it's probably gone so I'm just

play03:47

going to call that timestamp those are

play03:49

now added at the bottom here and it's

play03:51

just going to give us a little bit more

play03:52

information and make it a little bit

play03:53

easier if we did want to adjust the

play03:55

scraper in the future to go back and add

play03:57

some more information

play03:58

foreign

play03:58

[Music]

play04:02

scraper set up here you can see it in

play04:05

this drop down for all of our scrapers

play04:06

but it's not very useful on its own so

play04:08

we need to combine it with a couple of

play04:09

other features within bardeen to make it

play04:11

really really useful and actually do

play04:13

what we need it to do so without further

play04:15

Ado I'm going to start an auto boot so

play04:17

when you create an auto book it's going

play04:18

to ask you first for a trigger the

play04:20

trigger that we're using is when

play04:21

something happens because we want it to

play04:23

happen as soon as I right click on one

play04:25

of the links so you'll see some of the

play04:27

options here that you can use which

play04:29

again is pretty cool when an email is

play04:31

starred that you can set a specific

play04:33

series of events that might be something

play04:35

that we would use to any starred emails

play04:38

become projects in Asana so any anything

play04:40

immediately gets taken into Osama and

play04:42

then we can use that to manage our

play04:43

products the things I found it most

play04:44

useful for are right clicks that's why

play04:46

I'm sticking with that so you can just

play04:47

search here for right right even when I

play04:51

right click on a website it's going to

play04:52

ask you for a name so I'm going to call

play04:55

this save listing so when I right click

play04:59

that's what it's going to come up with

play04:59

and it asks you where it should be

play05:01

visible on and you can say it's only

play05:03

visible on specific bits of the site so

play05:05

if you only wanted images or any other

play05:07

Page Body on this case we only actually

play05:09

want on links because we want to use the

play05:11

link as the page that should be scraped

play05:13

rather than the page that we're already

play05:15

on so now we've got this first piece of

play05:17

that flow here what I can then do is

play05:19

click on this plus button and that will

play05:20

allow me to create an action to happen

play05:23

as a result of what we've just done so

play05:25

I'm going to click new action here and

play05:27

now we are going to screen we're given

play05:29

three options for what we want to scrape

play05:32

here I'm going to scrape data in the

play05:34

background because we don't actually

play05:35

want to load the page we just want to

play05:37

run through the list right click on each

play05:38

one and scrape it directly from that

play05:40

page and then save that to a sheet so to

play05:42

create data in the background it's going

play05:44

to ask us which page should be scraped

play05:46

now you'll see here we have Trigger

play05:48

action from our previous step so when I

play05:50

right click on a website I want to take

play05:52

that URL and scrape that page in

play05:54

particular then it's going to ask us

play05:55

which scraper template and we have the

play05:58

template that we created earlier right

play05:59

now single page just one can select that

play06:01

it asks for the number of list items to

play06:03

be extracted but actually that doesn't

play06:04

really matter for what we're doing

play06:06

because it's not a table scraping setup

play06:09

it's actually just a single page so I'm

play06:12

going to ignore that and I'm going to

play06:13

click that as done so now that we've got

play06:15

all the data here we can click done save

play06:17

it and test that it's working so I'm

play06:19

going to call this RM data extractor

play06:23

I'm going to click on Save and that's

play06:25

going to bring me back to this main page

play06:26

I can activate this now which is exactly

play06:28

what I'm going to do so I'm going to

play06:29

right click on left click here even this

play06:32

has now been scheduled and it will now

play06:33

run automatically on the page so it can

play06:35

hard bardeen and we can come back to our

play06:38

property here and because this is all a

play06:40

link I can right click anywhere I now

play06:42

have this wonderful piece here that says

play06:45

save listing if I click on that that's

play06:47

going to run in the background I'm not

play06:49

going to see anything happen but what I

play06:50

can then do is reopen bardine and have a

play06:53

look at the activity now I can see

play06:55

called width here and the returns data

play06:58

and I can see all of the data that I

play07:00

wanted fantastic news has been pulled

play07:02

out really pleased that that's working

play07:03

we'll get on now to the final part which

play07:06

is pulling that data from the scraper

play07:08

into our spreadsheet

play07:12

I'm just bringing back up the bardeen

play07:14

workflow again we've got my auto book

play07:17

that we created earlier which I've just

play07:18

called RM now it is running it's getting

play07:21

the data out and we're quite happy with

play07:22

it but we want to take it a step further

play07:24

and add it to this right move scraping

play07:25

data spreadsheet if you've not connected

play07:28

the two platforms together before you

play07:30

will have to

play07:31

um agree to a couple of terms just to

play07:33

allow bardeen to talk with Google Sheets

play07:35

it's the same with any any of the other

play07:37

apps so I've got that set up already so

play07:40

I'm going to come into more options

play07:42

and open in Builder again we've got our

play07:44

lovely two steps here and we're going to

play07:46

add another action at the end here we've

play07:49

got the two sections here from what

play07:50

we've created already I'm going to click

play07:52

on the add action once more and open up

play07:55

another action I've already connected

play07:57

Google Sheets so I can just type in

play07:59

sheet and add rows to Google sheet I

play08:01

think that option will still appear but

play08:03

it will just take you through the sign

play08:04

in and set up flow if you've not done it

play08:06

before so I'm going to add row to Google

play08:07

sheet and it's going to ask me which

play08:09

Google sheet I've got my sheet ready to

play08:11

go

play08:13

which is a right move that's going to

play08:14

bring it up fantastic and then here it's

play08:16

going to ask for how we want the data to

play08:19

come in

play08:20

now it says from table you can add

play08:22

column and or you can use commands and

play08:25

if I come to use commands it's going to

play08:27

give me some options including using the

play08:29

data from action one to add directly to

play08:33

the table so I'm just going to click on

play08:34

that and that's actually all we need to

play08:36

do I'm going to click on done and click

play08:39

on done up here because we now have

play08:41

everything we need to activate this and

play08:43

start collating data

play08:46

so I can close this out auto book now

play08:49

entirely and I can run through here and

play08:51

I can say oh this one looks great I'm

play08:53

just going to grab this save listing

play08:56

and then we can take a look and see if

play08:58

it actually saves it

play09:01

and there we go all of that data is now

play09:03

coming in and we have set up the The

play09:06

Columns because it was the first time

play09:08

we've done it

play09:09

but just to show you how this would work

play09:10

then now that we've got that data in

play09:12

here I can come down I can see this one

play09:13

looks really nice and

play09:16

don't like this one but I like this one

play09:21

I'm not buying any of these houses these

play09:23

are very expensive houses

play09:26

and I have no money

play09:29

so I've right clicked a load of times

play09:30

now and I'm just going to come back into

play09:32

here so here are the listings in the

play09:35

sheet as I want it I have gone back

play09:37

afterwards and added the URL in to the

play09:40

same setup that I had before because I

play09:42

want to be grabbing that as well and

play09:43

keeping that every time that I pull any

play09:45

any information in my link I think

play09:47

there's some really cool stuff that you

play09:48

can do with this way beyond what I've

play09:49

done today but I would really recommend

play09:51

giving it a try especially if you're

play09:53

pulling data or doing market research it

play09:55

can be really helpful to pull everything

play09:57

into a spreadsheet and then allow you to

play09:59

much more quickly analyze a large data

play10:01

set I hope you found this interesting

play10:03

and I can't wait to see what you decide

play10:05

to do with bardeen do let me know in the

play10:07

comments if you get anything cool

play10:08

working I'd love to explore some more

play10:09

opportunities to use that software

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
Bardeen ToolData ScrapingAutomationRightmoveWorkflowChrome ExtensionE-commerceIntegrationProperty SearchGoogle Sheets