Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

whr th rsults ar savd ? and how to add mail in postgrs? #27

Open
TEST11R opened this issue Jan 20, 2024 · 4 comments
Open

whr th rsults ar savd ? and how to add mail in postgrs? #27

TEST11R opened this issue Jan 20, 2024 · 4 comments

Comments

@TEST11R
Copy link

TEST11R commented Jan 20, 2024

image

@TEST11R
Copy link
Author

TEST11R commented Jan 21, 2024

and how to add custom queries? it is only picking up data that is by default even the queries are edited..

@gosom
Copy link
Owner

gosom commented Jan 21, 2024

you are doing something wrong.

the program takes as an input a file .

you need to ensure that this file is the one that you add in the container.

Please follow the documentation exactly .

Additionally, as I wrote you in another ticket I never tried that in windows and I don't plan to.

@TEST11R
Copy link
Author

TEST11R commented Jan 22, 2024

you are doing something wrong.

the program takes as an input a file .

you need to ensure that this file is the one that you add in the container.

Please follow the documentation exactly .

Additionally, as I wrote you in another ticket I never tried that in windows and I don't plan to.

it is working in wsl ubuntu ( WINDOWS) , the inputs are also updated , but we have questions if we can config what data to be scrapped? bcoz it is scrapping reviews and more.
Q1> CAN WE CONFIG IT? //suppose TITLE , LAT + LONG, MAIL ETC.
Q2> HOW TO CONFIGURE RADIUS? MOST SEARCHES ARE ONLY 40-60,,,, HOW CAN WE GET 120 LIKE NORMALLY THE BROWSER DOES? // THE DEPTH IS DEFAULT 10 DOES THAT MAKES MORE SEARCHES SUPPOSE WE SET TO 25 ?
Q3> MAIL SCRAP ONLY GETS 10% OF THE DATA HOW TO GET MORE RATE?

TQ

@gosom
Copy link
Owner

gosom commented Feb 4, 2024

A1) Not it scrapes and writes all the data. You need to manually keep the data you need in your csv. Open with Excel or similar software and delete the unwanted columns or process it with python or bash. I will consider adding a feature to customize the output

A2) Try increasing the depth as you pointed out.

A3) You mean that only 10% of the data have email? The emails are extracted from the website of the place entry (if any). Do the entries you are checking do have a website and an email in their website?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants