-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve HTML filtering rules $$
-- allow CSS-like selectors
#94
Comments
$$
$$
-- allow CCS-like selectors
$$
-- allow CCS-like selectors$$
-- allow CSS-like selectors
Propose to change the current spec in the following way. Replace
with
. Replace:
with:
. Add to the description of rule parts:
. Add to the description of the
. |
I'd like us to do some improvements alongside this.
|
Do they have |
@Alex-302 this is a very good point! Btw, do you think it makes sense to extend |
I'm not sure. Does the length of the removing script or length limit affect applying performance? Will there be support for regular expressions in |
@ngorskikh Which If we can guarantee that the inner one (and not the outer one) will be removed, then I guess it's okay to simply drop Also, we should check existing rules and decide what we do with them. Do we simply remove
Yes, sure. |
@ameshkov In CoreLibs, HTML rules are matched against the unmodified HTML stream, which means that in your example, both By the way, the same sort of ambiguity exists in CSS rules: for example, the |
@ngorskikh makes sense, thank you. @Alex-302 can there be any trouble because of that? Is |
Usually we try to make the matching specific. But sometimes we apply a |
@Alex-302 generally, it does not matter much. Also, if there will be regex support, you can do something like that: |
Ok, that's what I meant by the regexp question. |
|
Is |
No, We may extend it gradually, but please open new issues for every case so that we could choose what to implement. |
Add ability to use syntax, similar to CSS.
For example:
example.com$$td[id="main_content"] > span[id^="post_"][tag-content="advertising"]
and
example.com$$td[id="main_content"] > span[id^="post_"][wildcard="<a href=\"*==""]
and
example.com$$[id="main_content"] > span[id^="post_"][wildcard="<a href=\"*==""]
if tag of
[id="main_content"]
is random(see 4pda.ru)The text was updated successfully, but these errors were encountered: