Suggest me a better approach for indexing in janus. #3916
Replies: 2 comments 6 replies
-
Can you explain how you perform the indexing? Do you have some automated process in place which dynamically creates new indexes? The usual approach is to first create the index and only afterwards insert new data for that index. If the index uses a property key that wasn't in the graph before, then it will be directly enabled. So, you don't have a delay afterwards and can directly insert data with this property which will be also be added to the new index. |
Beta Was this translation helpful? Give feedback.
-
@MounikaMandi how do you manage those 600 keyspaces? From what I understand, Janusgraph can only support one keyspace. |
Beta Was this translation helpful? Give feedback.
-
Hi,
We have cassandra as our storage backend. We have nealry 600 key spaces. Each key space contains 3.5M nodes and 40M relations on an average. We are using elastic search as our mixed index backend.
Let X be my keyspace,Indexing is failing for X ,when ever any read/write operation is getting triggered for that X while it is getting indexed on any new prop.
Currently we are pausing all the ongoing ingestions while indexing and writing it to some intermediate inmemory db . After indexing is done we are reprocessing the stored data.
Some keyspaces with huge data are taking hours of time to get indexed.This is causing much data lag and memory issues at our end. This has been a major challenge at our end.
I wanted to know how to handle this type of scenario?
How to speed up indexing procedure?
Can we configure janus to handle ingestions while getting indexed? Like background indexing?
Any relavant documentation / research paper / resources / suggestions are appreciated to tackle this kind scenario.
Thanks in advance.
Beta Was this translation helpful? Give feedback.
All reactions