Skip to content
@stanford-futuredata

Future Data Systems

We are a CS research group building data-intensive systems

Popular repositories Loading

  1. ColBERT ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    Python 2.8k 365

  2. macrobase macrobase Public

    MacroBase: A Search Engine for Fast Data

    Java 659 126

  3. noscope noscope Public

    Accelerating network inference over video

    Python 436 122

  4. sparser sparser Public

    Sparser: Raw Filtering for Faster Analytics over Raw Data

    C 430 55

  5. ARES ARES Public

    Python 415 47

  6. dawn-bench-entries dawn-bench-entries Public

    DAWNBench: An End-to-End Deep Learning Benchmark and Competition

    Python 259 74

Repositories

Showing 10 of 71 repositories
  • ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    stanford-futuredata/ColBERT’s past year of commit activity
    Python 2,777 MIT 365 69 20 Updated Aug 17, 2024
  • lotus Public

    LOTUS: Enabling Semantic Queries with LLMs Over Tables of Unstructured and Structured Data

    stanford-futuredata/lotus’s past year of commit activity
    Python 76 MIT 6 0 0 Updated Aug 15, 2024
  • stk Public
    stanford-futuredata/stk’s past year of commit activity
    Python 79 Apache-2.0 17 3 1 Updated Aug 9, 2024
  • ARES Public
    stanford-futuredata/ARES’s past year of commit activity
    Python 415 Apache-2.0 47 8 1 Updated Aug 7, 2024
  • gavel Public

    Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

    stanford-futuredata/gavel’s past year of commit activity
    Jupyter Notebook 123 MIT 31 8 2 Updated Jul 25, 2024
  • ACORN Public

    state-of-the-art search over vector embeddings and structured data (SIGMOD '24)

    stanford-futuredata/ACORN’s past year of commit activity
    C++ 35 MIT 4 4 0 Updated Jun 19, 2024
  • FrugalGPT Public

    FrugalGPT: better quality and lower cost for LLM applications

    stanford-futuredata/FrugalGPT’s past year of commit activity
    Python 165 Apache-2.0 17 2 0 Updated May 11, 2024
  • InQuest Public

    Accelerating Aggregation Queries on Unstructured Streams of Data

    stanford-futuredata/InQuest’s past year of commit activity
    Python 7 2 1 0 Updated Apr 18, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    stanford-futuredata/Megatron-LM’s past year of commit activity
    Python 31 2,246 0 2 Updated Jan 19, 2024
  • tasti Public

    Semantic Indexes for Machine Learning-based Queries over Unstructured Data (SIGMOD 2022)

    stanford-futuredata/tasti’s past year of commit activity
    Python 13 5 0 0 Updated Jan 17, 2024

Top languages

Loading…

Most used topics

Loading…