Skip to content
Change the repository type filter

All

    Repositories list

    • Instruct-to-SPARQL is a dataset that consists of pairs of Natural language instructions and SPARQL queries. The dataset is created by crawling Wikipedia pages and tutorials for real examples of WikiData SPARQL queries.
      Jupyter Notebook
      Apache License 2.0
      0100Updated Nov 5, 2024Nov 5, 2024
    • searchlab

      Public
      Exploring Conversational and Traditional Search Interfaces in Information Retrieval
      TypeScript
      Apache License 2.0
      1000Updated Nov 5, 2024Nov 5, 2024
    • Accompanying code for the paper "Can GPT-4 Replace Human Examiners? A Competition on Checking Open-Text Answers"
      Python
      Apache License 2.0
      0000Updated Oct 21, 2024Oct 21, 2024
    • Python
      124100Updated Sep 25, 2024Sep 25, 2024
    • Agent4DL is a Python-based simulation framework designed to model user search behavior using agentic large language models (LLMs).
      Python
      1000Updated Aug 22, 2024Aug 22, 2024
    • Python
      Apache License 2.0
      0100Updated Jul 7, 2024Jul 7, 2024
    • Accompanying code for the paper "Performance analysis of large language models in the domain of legal argument mining"
      Python
      0000Updated Jun 22, 2024Jun 22, 2024
    • This study compares EconBiz (private) and SUSS (public) datasets, revealing EconBiz's more detailed user interactions. It highlights the scarcity of public datasets with rich user data in digital libraries and explores using LLMs to simulate detailed interactions while preserving anonymity, aiming to enhance public datasets for improved research.
      Python
      1000Updated May 17, 2024May 17, 2024
    • Classify webpages using their URLs
      Python
      Apache License 2.0
      0100Updated May 6, 2024May 6, 2024
    • Code to crawl content of a dataset of URLs
      Python
      Apache License 2.0
      0100Updated May 1, 2024May 1, 2024
    • Official implementation of "Intra-Class Similarity-Guided Feature Distillation" accepted in NeurIPS-ENLSP 2023
      Python
      0100Updated Apr 26, 2024Apr 26, 2024
    • Official implementation of "Using Pre-Trained Language Models in an End-to-End Pipeline for Antithesis Detection" accepted in LREC-2024
      Python
      0100Updated Apr 18, 2024Apr 18, 2024
    • Official implementation of "Learn From One Specialized Sub-Teacher: One-to-One Mapping for Feature-Based Knowledge Distillation" accepted in EMNLP-Findings 2023
      Python
      0100Updated Apr 18, 2024Apr 18, 2024
    • krony-PT

      Public
      Compressing GPT2 using Kronecker products.
      Python
      0100Updated Apr 7, 2024Apr 7, 2024
    • memBERT

      Public
      Source code for the MemBERT paper
      MIT License
      0100Updated Mar 27, 2024Mar 27, 2024
    • Code for the paper 'A Longitudinal Study of Content Control Mechanisms' presented at the TempWeb workshop (WWW'24)
      Java
      0200Updated Mar 19, 2024Mar 19, 2024
    • A Longitudinal study of robots.txt files extracted from the Common Crawl web archive
      HTML
      0100Updated Mar 10, 2024Mar 10, 2024
    • simiir-2

      Public
      SimIIR 2.0 extends the Python-based SimIIR framework for simulating interactive information retrieval (IIR).
      Click
      MIT License
      1500Updated Nov 22, 2023Nov 22, 2023
    • pypads

      Public
      Building on the MLFlow toolset this project aims to extend the functionality for MLFlow, increase the automation and therefore reduce the workload for the user. The production of structured results is an additional goal of the extension.
      Python
      GNU General Public License v3.0
      41451Updated Apr 2, 2023Apr 2, 2023
    • pypadre

      Public
      In this research project, we aim to create an environment to gather structured data about machine learning experiments in order to analyze data and algorithmich dependencies.
      Python
      MIT License
      1304Updated Nov 17, 2022Nov 17, 2022
    • An extension of pypads tracking machine learning workflows (steps and concepts). Most of the concepts are derived from PaDRe.
      Python
      GNU General Public License v3.0
      1200Updated May 18, 2021May 18, 2021
    • Helm Chart & Documentation for deploying JupyterHub on Kubernetes
      Python
      Other
      799000Updated May 17, 2021May 17, 2021
    • Web Page for the PAssau Data science REsearch Lab
      HTML
      0000Updated Jan 5, 2021Jan 5, 2021
    • Python file examples for PyPads
      Jupyter Notebook
      0000Updated Dec 18, 2020Dec 18, 2020
    • Extension for ontology integrations
      Python
      GNU General Public License v3.0
      1200Updated Dec 17, 2020Dec 17, 2020
    • brwsr

      Public
      Lightweight Linked Data Browser
      Python
      10000Updated Dec 17, 2020Dec 17, 2020
    • Jupyter Notebook
      0000Updated Nov 18, 2020Nov 18, 2020
    • Production ready docker-compose configuration for ML Flow with Mysql and Minio S3
      Shell
      MIT License
      88000Updated Sep 6, 2020Sep 6, 2020
    • This repository includes a set of examples how to use pypads
      Python
      0000Updated Jun 22, 2020Jun 22, 2020
    • This repository shows examples how PyPads can be used with a jupyter notebook.
      Jupyter Notebook
      0000Updated Jun 22, 2020Jun 22, 2020