Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate an HTML parser with XPath 2 support #702

Open
vdusek opened this issue Nov 15, 2024 · 0 comments
Open

Integrate an HTML parser with XPath 2 support #702

vdusek opened this issue Nov 15, 2024 · 0 comments
Labels
t-tooling Issues with this label are in the ownership of the tooling team.

Comments

@vdusek
Copy link
Collaborator

vdusek commented Nov 15, 2024

  • Both Parsel and BeautifulSoup (lxml) support only XPath 1.
  • Research and identify an HTML parser with support for XPath 2.
  • If such a parser exists, explore integrating it into Crawlee, potentially as a new crawler type.
@github-actions github-actions bot added the t-tooling Issues with this label are in the ownership of the tooling team. label Nov 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
t-tooling Issues with this label are in the ownership of the tooling team.
Projects
None yet
Development

No branches or pull requests

1 participant