π Welcome to OPEN-ARC, an open-source initiative to further AI research through collaboration. This repository contains base model files and folders for various problem sets, each accompanied by a challenge and linked Kaggle notebooks to run the models. Users can link their notebooks to the challenges, share their findings, and help improve AI models to help solve problems in various fields and further research enhancement.
- Introduction
- Installation
- Usage
- Contributing
- Available Projects
- Sharing Progress
- Uploading Models and Sharing Code
- License
OPEN-ARC aims to democratize AI research by providing a collaborative platform where users can:
- Access and work on various AI problem sets.
- Contribute their solutions and improvements.
- Learn from others' approaches and findings.
- Help improve AI models for real-world applications and communities.
To start with OPEN-ARC, clone the repository or start from your notebook locally or in Kaggle or Colab using the provided datasets.
git clone https://github.com/infinitode/open-arc.git
cd open-arc
Tip
You can also download the available Jupyter notebooks and base models. They contain everything you need for that project, if you want to work on specific projects only.
Each project folder contains our:
- Base model files.
- Instructions for running the models.
- Linked Kaggle notebooks for each challenge.
Note
Base models are not available for all of the projects in OPEN-ARC.
To run a model, you can just navigate to the project folder and follow the instructions in the README.md file located there. You can also run the Kaggle notebook, either locally or in Kaggle/Colab (note that we will primarily use Kaggle Datasets, which means that you'd have to obtain the datasets either from Kaggle, or another source, to run the code locally or in Colab), and follow the steps inside.
We only provide the base models, and the basic code implementations, this is where the power of community comes in. You can either improve the base code or write your own. Then, others can learn from your implementation, therefore furthering research, and helping communities worldwide.
We welcome contributions from the community. To contribute to the project:
- Fork the repository.
- Create a new branch for your implementation.
- Commit your changes and push them to your branch.
- Create a pull request detailing your changes.
Tip
You can also work on a project and share your results by submitting a link to your notebook. If you'd like to have your entry listed on the leaderboard, fork the repo and update LEADERBOARD.md
in your pull request.
To protect our community, please ensure your contributions adhere to our Code of Conduct.
Here are some of the current projects available in OPEN-ARC:
- Challenge: Predict the stage of liver cirrhosis, to help medical experts quickly assess a patient's condition.
- Dataset: Liver Cirrhosis Stage Classification π©Ί
- Kaggle Notebook: Link to Notebook
- Instructions: Detailed instructions on running the model are available in the project's README.
Rank | Contributor | Architecture Type | Platform | Base Model | Dataset | Accuracy | Link |
---|---|---|---|---|---|---|---|
N | Our Model | RandomForestClassifier | Kaggle | β | Liver Cirrhosis Stage Classification π©Ί | 95.6% | Notebook |
- Challenge: Classify the type of weather using 10 different weather data features.
- Dataset: Weather Type Classification
- Kaggle Notebook: Link to Notebook
- Instructions: Detailed instructions on running the model are available in the project's README.
Rank | Contributor | Architecture Type | Platform | Base Model | Dataset | Accuracy | Link |
---|---|---|---|---|---|---|---|
N | Our Model | RandomForestClassifier | Kaggle | β | Weather Type Classification | 91.2% | Notebook |
- Challenge: Classify potato plant diseases based on image/vision data.
- Dataset: π± Potato Plant Diseases Data π
- Kaggle Notebook: Link to Notebook
- Instructions: Detailed instructions on running the model are available in the project's README.
Rank | Contributor | Architecture Type | Platform | Base Model | Dataset | Accuracy | Link |
---|---|---|---|---|---|---|---|
N | Our Model | CustomCNN | Kaggle | β | π± Potato Plant Diseases Data π | 95.1% | Notebook |
- Challenge: Classify the quality of red wine based on sensory data.
- Dataset: Red Wine Quality
- Kaggle Notebook: Link to Notebook
- Instructions: Detailed instructions on running the model are available in the project's README.
Rank | Contributor | Architecture Type | Platform | Base Model | Dataset | Accuracy | Link |
---|---|---|---|---|---|---|---|
N | Our Model | GradientBoostingClassifier | Kaggle | β | Red Wine Quality | 72.8% | Notebook |
- Challenge: Generate Terraria weapon names based on weapon names from the official game.
- Dataset: All Terraria Weapons DPS V_1.4.4.9
- Kaggle Notebook: Link to Notebook
- Instructions: Detailed instructions on running the model are available in the project's README.
Rank | Contributor | Architecture Type | Platform | Base Model | Dataset | Accuracy | Link |
---|---|---|---|---|---|---|---|
N | Our Model | SimpleRNN | Kaggle | β | All Terraria Weapons DPS V_1.4.4.9 | 78.6% | Notebook |
- Challenge: Generate convincing news headlines based on short passages of text.
- Dataset: NEWS SUMMARY
- Kaggle Notebook: Link to Notebook
- Instructions: Detailed instructions on running the model are available in the project's README.
Rank | Contributor | Architecture Type | Platform | Base Model | Dataset | BLEU-Score | Link |
---|---|---|---|---|---|---|---|
N | Our Model | DistilBART | Kaggle | β | NEWS SUMMARY | 52.8% | Notebook |
- Challenge: Recommend crops based on a given set of features to optimize crop yields.
- Dataset: Crop Recommendation Dataset
- Kaggle Notebook: Link to Notebook
- Instructions: Detailed instructions on running the model are available in the project's README.
Rank | Contributor | Architecture Type | Platform | Base Model | Dataset | Accuracy | Link |
---|---|---|---|---|---|---|---|
N | Our Model | XGBClassifier | Kaggle | β | Crop Recommendation Dataset | 98.6% | Notebook |
- Challenge: Predict a plant's stress level based on certain features.
- Dataset: Plant-Health-Data
- Kaggle Notebook: Link to Notebook
- Instructions: Detailed instructions on running the model are available in the project's README.
Rank | Contributor | Architecture Type | Platform | Base Model | Dataset | Accuracy | Link |
---|---|---|---|---|---|---|---|
N | Our Model | XGBClassifier | Kaggle | β | Plant-Health-Data | 99.1% | Notebook |
- Challenge: Predict traffic accidents based on certain features.
- Dataset: Traffic Accident Prediction π₯π
- Kaggle Notebook: Link to Notebook
- Instructions: Detailed instructions on running the model are available in the project's README.
Rank | Contributor | Architecture Type | Platform | Base Model | Dataset | Accuracy | Link |
---|---|---|---|---|---|---|---|
N | Our Model | XGBClassifier | Kaggle | β | Traffic Accident Prediction π₯π | 85.2% | Notebook |
More projects will be added soon!
We encourage users to share their progress and improvements. You can do this in several ways:
-
Local Notebooks:
- Save your Jupyter notebooks in the respective project folder and create a pull request with your findings and steps.
-
Google Colab:
- Upload your notebooks to Colab, then share the link and a brief description in the project's issue tracker or discussion board.
-
Kaggle Notebooks:
- Link your Kaggle notebooks to the respective project by creating a pull request or an issue with the notebook link and a summary of your approach and results.
-
Notebook Documentation:
- Include your steps, process, and any issues encountered directly within the notebook. This documentation helps others understand your approach and learn from it.
Feel free to document your process and findings at the bottom of the project README for others to learn from and improve upon.
You can also upload your trained models and code to GitHub and share the links in our repository's leaderboard. Hereβs how:
-
Upload Your Model (optional):
- Save your model files and code in a new repository or in a dedicated branch of your forked repository.
- Ensure you provide clear documentation and instructions on how to use your model.
-
Share the Link:
- Navigate to the
LEADERBOARD.md
file in the main repository. - Add an entry with your notebook's link, a brief description, and your results.
- Create a pull request with your updates to the
LEADERBOARD.md
file.
- Navigate to the
-
Post on the Leaderboard:
- Once your pull request is merged, your model and results will be visible on the leaderboard for others to view and collaborate on.
Users can quickly share their contributions and help others in the community learn and improve their models and research.
This project is licensed under the MIT License - see the LICENSE file for more details about this repo's license.
We hope OPEN-ARC becomes a thriving community of developers, helping improve AI tools for communities around the world, and drive new research and technology.
Happy coding and collaborating!
~ Infinitode