Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve viewer performance to better handle large result sets #3

Open
ghostwords opened this issue Mar 20, 2015 · 2 comments
Open

Improve viewer performance to better handle large result sets #3

ghostwords opened this issue Mar 20, 2015 · 2 comments

Comments

@ghostwords
Copy link
Owner

Once you get into tens of thousands of URLs, the viewer starts taking too long to render results.

The backend becomes slow/starts using a lot of RAM (bad, but OK for a single-user app) and the results page gets too big to render in a reasonable amount of time (much worse since it makes the viewer unusable past a certain point).

@ghostwords ghostwords added the bug label Mar 20, 2015
@cooperq
Copy link

cooperq commented Mar 24, 2015

I would say that we need to be able to handle about 100,000 domains. Here are some links that might be useful:
http://w2ui.com/web/blog/7/JavaScript-Grid-with-One-Million-Records
http://misoproject.com/dataset/

@ghostwords ghostwords self-assigned this Mar 24, 2015
@ghostwords
Copy link
Owner Author

Viewer redesign plan:

  • The list of crawls goes away, at least in its current form as the front page.
  • The results screen becomes the front page, with search filters expanded to include the list of crawls.
  • The results will no longer show script-level data. All data will get collapsed to domain level.
  • The results page will probably be a dynamically-rendered, sortable table. Loading visible rows only (occlusion rendering?) is like pagination 2.0: performance w/o the extra burden of pagination controls.
  • There will be a new domain detail report page, reachable from the main results page. This page will present script-level findings and provide additional information such as exact properties accessed, etc.

@ghostwords ghostwords removed their assignment Apr 24, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants