Skip to content

Open-source Platform for Engineering Neural Architectures and Research Collaboration. Developing and improving AI tools for everyone.

License

Notifications You must be signed in to change notification settings

Infinitode/OPEN-ARC

Repository files navigation

OPEN-ARC title image

OPEN-ARC

Open-source Platform for Engineering Neural Architectures and Research Collaboration

πŸ‘‹ Welcome to OPEN-ARC, an open-source initiative to further AI research through collaboration. This repository contains base model files and folders for various problem sets, each accompanied by a challenge and linked Kaggle notebooks to run the models. Users can link their notebooks to the challenges, share their findings, and help improve AI models to help solve problems in various fields and further research enhancement.

Table of Contents

Introduction

OPEN-ARC aims to democratize AI research by providing a collaborative platform where users can:

  • Access and work on various AI problem sets.
  • Contribute their solutions and improvements.
  • Learn from others' approaches and findings.
  • Help improve AI models for real-world applications and communities.

Installation

To start with OPEN-ARC, clone the repository or start from your notebook locally or in Kaggle or Colab using the provided datasets.

git clone https://github.com/infinitode/open-arc.git
cd open-arc

Tip

You can also download the available Jupyter notebooks and base models. They contain everything you need for that project, if you want to work on specific projects only.

Usage

Each project folder contains our:

  • Base model files.
  • Instructions for running the models.
  • Linked Kaggle notebooks for each challenge.

Note

Base models are not available for all of the projects in OPEN-ARC.

To run a model, you can just navigate to the project folder and follow the instructions in the README.md file located there. You can also run the Kaggle notebook, either locally or in Kaggle/Colab (note that we will primarily use Kaggle Datasets, which means that you'd have to obtain the datasets either from Kaggle, or another source, to run the code locally or in Colab), and follow the steps inside.

We only provide the base models, and the basic code implementations, this is where the power of community comes in. You can either improve the base code or write your own. Then, others can learn from your implementation, therefore furthering research, and helping communities worldwide.

Contributing

We welcome contributions from the community. To contribute to the project:

  1. Fork the repository.
  2. Create a new branch for your implementation.
  3. Commit your changes and push them to your branch.
  4. Create a pull request detailing your changes.

Tip

You can also work on a project and share your results by submitting a link to your notebook. If you'd like to have your entry listed on the leaderboard, fork the repo and update LEADERBOARD.md in your pull request.

To protect our community, please ensure your contributions adhere to our Code of Conduct.

Available Projects

Here are some of the current projects available in OPEN-ARC:

Project 1: Liver Cirrhosis Stage Classification

Leaderboard (Top 5)

Rank Contributor Architecture Type Platform Base Model Dataset Accuracy Link
N Our Model RandomForestClassifier Kaggle βœ— Liver Cirrhosis Stage Classification 🩺 95.6% Notebook

Project 2: Weather Type Classification

  • Challenge: Classify the type of weather using 10 different weather data features.
  • Dataset: Weather Type Classification
  • Kaggle Notebook: Link to Notebook
  • Instructions: Detailed instructions on running the model are available in the project's README.

Leaderboard (Top 5)

Rank Contributor Architecture Type Platform Base Model Dataset Accuracy Link
N Our Model RandomForestClassifier Kaggle βœ— Weather Type Classification 91.2% Notebook

Project 3: Potato Plant Disease Classification

Leaderboard (Top 5)

Rank Contributor Architecture Type Platform Base Model Dataset Accuracy Link
N Our Model CustomCNN Kaggle βœ— 🌱 Potato Plant Diseases Data πŸ‚ 95.1% Notebook

Project 4: Red Wine Quality Classification

  • Challenge: Classify the quality of red wine based on sensory data.
  • Dataset: Red Wine Quality
  • Kaggle Notebook: Link to Notebook
  • Instructions: Detailed instructions on running the model are available in the project's README.

Leaderboard (Top 5)

Rank Contributor Architecture Type Platform Base Model Dataset Accuracy Link
N Our Model GradientBoostingClassifier Kaggle βœ— Red Wine Quality 72.8% Notebook

Project 5: Terraria Weapon Name Generation

  • Challenge: Generate Terraria weapon names based on weapon names from the official game.
  • Dataset: All Terraria Weapons DPS V_1.4.4.9
  • Kaggle Notebook: Link to Notebook
  • Instructions: Detailed instructions on running the model are available in the project's README.

Leaderboard (Top 5)

Rank Contributor Architecture Type Platform Base Model Dataset Accuracy Link
N Our Model SimpleRNN Kaggle βœ” All Terraria Weapons DPS V_1.4.4.9 78.6% Notebook

Project 6: News Headline Generation

  • Challenge: Generate convincing news headlines based on short passages of text.
  • Dataset: NEWS SUMMARY
  • Kaggle Notebook: Link to Notebook
  • Instructions: Detailed instructions on running the model are available in the project's README.

Leaderboard (Top 5)

Rank Contributor Architecture Type Platform Base Model Dataset BLEU-Score Link
N Our Model DistilBART Kaggle βœ— NEWS SUMMARY 52.8% Notebook

Project 7: Crop Recommendation

  • Challenge: Recommend crops based on a given set of features to optimize crop yields.
  • Dataset: Crop Recommendation Dataset
  • Kaggle Notebook: Link to Notebook
  • Instructions: Detailed instructions on running the model are available in the project's README.

Leaderboard (Top 5)

Rank Contributor Architecture Type Platform Base Model Dataset Accuracy Link
N Our Model XGBClassifier Kaggle βœ” Crop Recommendation Dataset 98.6% Notebook

Project 8: Plant Stress Prediction Model

  • Challenge: Predict a plant's stress level based on certain features.
  • Dataset: Plant-Health-Data
  • Kaggle Notebook: Link to Notebook
  • Instructions: Detailed instructions on running the model are available in the project's README.

Leaderboard (Top 5)

Rank Contributor Architecture Type Platform Base Model Dataset Accuracy Link
N Our Model XGBClassifier Kaggle βœ” Plant-Health-Data 99.1% Notebook

Project 9: Traffic Accident Prediction Model

Leaderboard (Top 5)

Rank Contributor Architecture Type Platform Base Model Dataset Accuracy Link
N Our Model XGBClassifier Kaggle βœ” Traffic Accident Prediction πŸ’₯πŸš— 85.2% Notebook

More projects will be added soon!

Sharing Progress

We encourage users to share their progress and improvements. You can do this in several ways:

  1. Local Notebooks:

    • Save your Jupyter notebooks in the respective project folder and create a pull request with your findings and steps.
  2. Google Colab:

    • Upload your notebooks to Colab, then share the link and a brief description in the project's issue tracker or discussion board.
  3. Kaggle Notebooks:

    • Link your Kaggle notebooks to the respective project by creating a pull request or an issue with the notebook link and a summary of your approach and results.
  4. Notebook Documentation:

    • Include your steps, process, and any issues encountered directly within the notebook. This documentation helps others understand your approach and learn from it.

Feel free to document your process and findings at the bottom of the project README for others to learn from and improve upon.

Uploading Models and Sharing Code

You can also upload your trained models and code to GitHub and share the links in our repository's leaderboard. Here’s how:

  1. Upload Your Model (optional):

    • Save your model files and code in a new repository or in a dedicated branch of your forked repository.
    • Ensure you provide clear documentation and instructions on how to use your model.
  2. Share the Link:

    • Navigate to the LEADERBOARD.md file in the main repository.
    • Add an entry with your notebook's link, a brief description, and your results.
    • Create a pull request with your updates to the LEADERBOARD.md file.
  3. Post on the Leaderboard:

    • Once your pull request is merged, your model and results will be visible on the leaderboard for others to view and collaborate on.

Users can quickly share their contributions and help others in the community learn and improve their models and research.

License

This project is licensed under the MIT License - see the LICENSE file for more details about this repo's license.



We hope OPEN-ARC becomes a thriving community of developers, helping improve AI tools for communities around the world, and drive new research and technology.

Happy coding and collaborating!

~ Infinitode