Stock-Prediction

Stock Price Prediction

Yuvraj Tiwary

This project demonstrates how machine learning models can be used to predict stock prices based on historical data, helping investors make informed decisions.

Simple Linear Regression: A basic regression technique that models the relationship between the independent variable (time) and the dependent variable (stock price) using a linear equation.

Support Vector Regression (SVR): A regression algorithm that constructs a hyperplane or set of hyperplanes in a high-dimensional space to optimize the margin between the data points and the hyperplane(s).

Decision Tree Regression: A non-linear regression technique that uses a decision tree model to predict the stock price by recursively partitioning the data into subsets based on the feature values.

Random Forest Regression: An ensemble learning method that constructs multiple decision trees during training and outputs the average prediction of the individual trees for improved accuracy and robustness.

By implementing and comparing these regression algorithms, this project aims to provide insights into the effectiveness of different approaches in predicting stock prices. Stakeholders can utilize the predictions generated by these models to make informed decisions regarding investment strategies, whether to invest in the stock or divest from the company.

Data Import and Cleaning

The script imports a dataset containing stock prices, and then cleans the data by removing unnecessary columns.

Data Visualization

It visualizes the data using scatter plots and histograms to understand the distribution and relationships between variables.

Model Building

It builds four different regression models: Simple Linear Regression, Support Vector Regression, Decision Tree Regression, and Random Forest Regression.

Model Evaluation

Each model is evaluated using the R-squared score to measure its accuracy in predicting stock prices.

Results Visualization

The script visualizes the accuracies of the four models using a bar chart, making it easy to compare their performance.

Predicting Future Stock Price

Finally, the script predicts the closing price of a stock for a specific date using the model with the highest accuracy.

Data Import and Cleaning:

The script imports a dataset containing stock prices, and then cleans the data by removing unnecessary columns.

Import Libraries and Dataset

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.metrics import r2_score
import warnings
warnings.filterwarnings('ignore')
sns.set_style('darkgrid')

dataset = pd.read_csv('ICICI_BANK.csv')

Information about the dataset

dataset.shape

(5306, 15)

from IPython.display import display, HTML

# Generate HTML code for the table
table_html = """
<div style="background-color: #f9f9f9; padding: 10px; border-radius: 5px;">
    <h3>Dataset Columns</h3>
    <table>
        <tr>
            <th>Column Name</th>
        </tr>
"""

for column in dataset.columns:
    table_html += f"""
        <tr>
            <td>{column}</td>
        </tr>
    """

table_html += """
    </table>
</div>
"""

# Display the HTML table
display(HTML(table_html))

Dataset Columns

    <tr>
        <td>Date</td>
    </tr>

    <tr>
        <td>Symbol</td>
    </tr>

    <tr>
        <td>Series</td>
    </tr>

    <tr>
        <td>Prev Close</td>
    </tr>

    <tr>
        <td>Open</td>
    </tr>

    <tr>
        <td>High</td>
    </tr>

    <tr>
        <td>Low</td>
    </tr>

    <tr>
        <td>Last</td>
    </tr>

    <tr>
        <td>Close</td>
    </tr>

    <tr>
        <td>VWAP</td>
    </tr>

    <tr>
        <td>Volume</td>
    </tr>

    <tr>
        <td>Turnover</td>
    </tr>

    <tr>
        <td>Trades</td>
    </tr>

    <tr>
        <td>Deliverable Volume</td>
    </tr>

    <tr>
        <td>%Deliverble</td>
    </tr>

</table>

from IPython.display import display, HTML

# Generate HTML code for the table
table_html = """
<div style="background-color: #f9f9f9; padding: 10px; border-radius: 5px;">
    <h3>Dataset Info</h3>
    <table>
        <tr>
            <th>Column Name</th>
            <th>Non-Null Count</th>
            <th>Dtype</th>
        </tr>
"""

for column in dataset.columns:
    non_null_count = dataset[column].count()
    dtype = dataset[column].dtype
    table_html += f"""
        <tr>
            <td>{column}</td>
            <td>{non_null_count}</td>
            <td>{dtype}</td>
        </tr>
    """

table_html += """
    </table>
</div>
"""

# Display the HTML table
display(HTML(table_html))

Dataset Info

Column Name

    <tr>
        <td>Date</td>
        <td>5306</td>
        <td>object</td>
    </tr>

    <tr>
        <td>Symbol</td>
        <td>5306</td>
        <td>object</td>
    </tr>

    <tr>
        <td>Series</td>
        <td>5306</td>
        <td>object</td>
    </tr>

    <tr>
        <td>Prev Close</td>
        <td>5306</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>Open</td>
        <td>5306</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>High</td>
        <td>5306</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>Low</td>
        <td>5306</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>Last</td>
        <td>5306</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>Close</td>
        <td>5306</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>VWAP</td>
        <td>5306</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>Volume</td>
        <td>5306</td>
        <td>int64</td>
    </tr>

    <tr>
        <td>Turnover</td>
        <td>5306</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>Trades</td>
        <td>2456</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>Deliverable Volume</td>
        <td>4789</td>
        <td>float64</td>
    </tr>

    <tr>
        <td>%Deliverble</td>
        <td>4789</td>
        <td>float64</td>
    </tr>

</table>

from IPython.display import display, HTML

# Get the describe() output
desc = dataset.describe()

# Convert the describe() output to an HTML table
table_html = "<table>"
table_html += "<tr><th></th>"
for column in desc.columns:
    table_html += f"<th>{column}</th>"
table_html += "</tr>"

for index, row in desc.iterrows():
    table_html += "<tr>"
    table_html += f"<td>{index}</td>"
    for value in row:
        table_html += f"<td>{value:.2f}</td>"
    table_html += "</tr>"

table_html += "</table>"

# Display the HTML table
display(HTML(table_html))

Column Name	Non-Null Count	Dtype

	Prev Close	Open	High	Low	Last	Close	VWAP	Volume	Turnover	Trades	Deliverable Volume	%Deliverble
count	5306.00	5306.00	5306.00	5306.00	5306.00	5306.00	5306.00	5306.00	5306.00	2456.00	4789.00	4789.00
mean	550.90	551.56	560.56	541.53	551.05	551.00	551.13	8224630.71	375929920576296.62	138367.63	4183406.04	0.47
std	368.78	368.89	374.08	363.39	368.71	368.73	368.75	12185349.04	475813345498016.69	99008.73	6365381.73	0.13
min	67.40	67.00	70.45	66.00	67.00	67.40	68.52	7409.00	96172830000.00	2595.00	15015.00	0.10
25%	267.56	267.40	271.91	263.62	267.40	267.61	267.58	961205.50	34594425000000.00	79312.25	699502.00	0.38
50%	398.08	399.00	406.52	392.45	398.70	398.18	398.24	3486647.50	292301000000000.00	110101.00	1963117.00	0.48
75%	873.56	877.00	888.77	859.80	874.60	873.56	873.51	11572021.25	499352750000000.00	162953.50	5948817.00	0.56
max	1794.10	1767.05	1798.15	1760.15	1793.00	1794.10	1783.46	286857658.00	14600000000000000.00	949891.00	232530747.00	0.98

Data Cleaning

# we can visualize, Before cleaning 
# display(dataset.head().style.hide_index())

# Delete unnecessary columns

# dataset.drop(["Symbol", "Series", "Prev Close", "High", "Low", "Last", "VWAP", "Volume", "Turnover", "Trades", "Deliverable Volume", "%Deliverble"], 
# axis = 1, inplace = True)
dataset.drop(columns=dataset.columns.difference(['Date', 'Open', 'Close']), inplace=True)

# we can visualize, Before cleaning 
# display(dataset.head().style.hide_index())

Data Visualization:

It visualizes the data using scatter plots and histograms to understand the distribution and relationships between variables.

import plotly.express as px

fig = px.scatter(dataset.head(100), x="Open", y="Close", title="Open v/s Close", color_discrete_sequence=['orange'])
fig.show()

// Listen for the removal of the full notebook cells var notebookContainer = gd.closest('#notebook-container'); if (notebookContainer) {{ x.observe(notebookContainer, {childList: true}); }}

// Listen for the clearing of the current output cell var outputEl = gd.closest('.output'); if (outputEl) {{ x.observe(outputEl, {childList: true}); }}

                    })                };                });            </script>        </div>

import plotly.graph_objects as go

# Create scatter plot
fig1 = px.scatter(dataset.head(100), x="Open", y="Close", title="Open v/s Close", color_discrete_sequence=['orange'])

# Create histogram
fig2 = px.histogram(dataset, x="Close", nbins=50, title="Histogram of Close Prices", color_discrete_sequence=['orange'])

# Create subplots
fig = go.Figure()
fig.add_trace(fig1['data'][0])
fig.add_trace(fig2['data'][0])

# Update layout
fig.update_layout(title="Scatter Plot and Histogram", showlegend=False)

# Show the figure
fig.show()

// Listen for the removal of the full notebook cells var notebookContainer = gd.closest('#notebook-container'); if (notebookContainer) {{ x.observe(notebookContainer, {childList: true}); }}

// Listen for the clearing of the current output cell var outputEl = gd.closest('.output'); if (outputEl) {{ x.observe(outputEl, {childList: true}); }}

                    })                };                });            </script>        </div>

Import Models

from sklearn.linear_model import LinearRegression
from sklearn.svm import SVR
from sklearn.tree import DecisionTreeRegressor
from sklearn.ensemble import RandomForestRegressor

Model Building:

It builds four different regression models: Simple Linear Regression, Support Vector Regression, Decision Tree Regression, and Random Forest Regression.

Simple Linear Regression

X = dataset['Open'].values
y = dataset['Close'].values

X_train, X_test, y_train, y_test = train_test_split(X, y, train_size=0.7, test_size=0.3)

model1 = LinearRegression()
build1 = model1.fit(X_train.reshape(-1, 1), y_train)
predict1 = model1.predict(X_test.reshape(-1, 1))

print("Co-efficient: ", model1.coef_)
print("\nIntercept: ", model1.intercept_)

Co-efficient:  [0.99860402]

Intercept:  0.24768269983928803

from IPython.display import HTML

html_table = df1.head(10).to_html(index=False, justify='center', classes='table table-striped table-hover table-bordered')
styled_table = f'<div style="text-align: center;"><style>table {{border-collapse: collapse; width: 50%;}} th, td {{border: 1px solid #dddddd; text-align: left; padding: 8px;}} th {{background-color: #f2f2f2;}}</style>{html_table}</div>'
display(HTML(styled_table))

Actual Values	Predicted Values
136.80	137.635962
330.20	339.076123
879.15	872.417691
262.50	263.598544
580.70	569.507672
1040.20	1024.272582
360.70	359.570035
852.15	856.422442
829.05	869.418582
187.60	184.622005

import plotly.express as px
import plotly.io as pio

# Create the bar plot
fig = px.bar(df1.head(50), title='Simple Linear Regression', barmode='group', color_discrete_sequence=px.colors.qualitative.Plotly)

# Customize the layout
fig.update_layout(
    xaxis_title='Index',
    yaxis_title='Values',
    legend_title='Data',
    width=1200,
    height=600,
    xaxis_tickangle=-45,  # Rotate x-axis labels for better readability
    showlegend=True,      # Show legend
    font=dict(size=12),   # Set font size
    plot_bgcolor='rgba(0,0,0,0)',  # Set plot background color
    paper_bgcolor='rgba(0,0,0,0)', # Set paper background color
    bargap=0.1,           # Set gap between bars
    xaxis=dict(showgrid=True, gridcolor='rgba(0,0,0,0.1)'), # Show x-axis gridlines
    yaxis=dict(showgrid=True, gridcolor='rgba(0,0,0,0.1)')  # Show y-axis gridlines
)

# Add data labels to the bars
fig.update_traces(texttemplate='%{y}', textposition='outside')

# Display the plot
fig.show()

# Save the plot as a PNG file
# pio.write_image(fig, 'bar_plot.png')

// Listen for the removal of the full notebook cells var notebookContainer = gd.closest('#notebook-container'); if (notebookContainer) {{ x.observe(notebookContainer, {childList: true}); }}

// Listen for the clearing of the current output cell var outputEl = gd.closest('.output'); if (outputEl) {{ x.observe(outputEl, {childList: true}); }}

                    })                };                });            </script>        </div>

accuracy1 = r2_score(y_test, predict1)
print("Accuracy of Simple Linear Regression:", accuracy1)

Accuracy of Simple Linear Regression: 0.99849322863788

Support Vector Regression

model2 = SVR(kernel="rbf", gamma = 0.01, C=100)
build2 = model2.fit(X_train.reshape(-1, 1), y_train)
predict2 = model2.predict(X_test.reshape(-1, 1))

df2 = pd.DataFrame(list(zip(y_test, predict2)), columns=["Actual Values", "Predicted Values"])

import pandas as pd

# Assuming df2 is your DataFrame
data = {'Actual Values': df2['Actual Values'], 'Predicted Values': df2['Predicted Values']}
df_table = pd.DataFrame(data)

styled_df_table = df_table.head(10).style.set_table_styles([
    {'selector': 'th.row_heading', 'props': 'display: none;'}
]).set_properties(**{'text-align': 'center'})

styled_df_table

	Actual Values	Predicted Values
0	1023.250000	1016.899042
1	392.050000	392.051720
2	550.900000	533.321320
3	301.100000	302.149611
4	362.300000	352.221443
5	212.900000	214.536149
6	264.000000	263.939771
7	631.750000	608.350294
8	967.900000	966.007576
9	574.700000	545.773854

import plotly.express as px

# Create the bar plot
fig = px.bar(df2.head(50), title='Simple Linear Regression', barmode='group', color_discrete_sequence=px.colors.qualitative.Plotly)

# Customize the layout
fig.update_layout(
    xaxis_title='Index',
    yaxis_title='Values',
    legend_title='Data',
    width=1200,
    height=600,
    xaxis_tickangle=-45,  # Rotate x-axis labels for better readability
    showlegend=True,      # Show legend
    font=dict(size=12),   # Set font size
    plot_bgcolor='rgba(0,0,0,0)',  # Set plot background color
    paper_bgcolor='rgba(0,0,0,0)', # Set paper background color
    bargap=0.1,           # Set gap between bars
    xaxis=dict(showgrid=True, gridcolor='rgba(0,0,0,0.1)'), # Show x-axis gridlines
    yaxis=dict(showgrid=True, gridcolor='rgba(0,0,0,0.1)')  # Show y-axis gridlines
)

# Add data labels to the bars
fig.update_traces(texttemplate='%{y}', textposition='outside')

# Display the plot
fig.show()

// Listen for the removal of the full notebook cells var notebookContainer = gd.closest('#notebook-container'); if (notebookContainer) {{ x.observe(notebookContainer, {childList: true}); }}

// Listen for the clearing of the current output cell var outputEl = gd.closest('.output'); if (outputEl) {{ x.observe(outputEl, {childList: true}); }}

                    })                };                });            </script>        </div>

accuracy2 = r2_score(y_test, predict2)
print("Accuracy of Support Vector Regression:", accuracy2)

Accuracy of Support Vector Regression: 0.9799364177634605

Decision Tree Regression

model3 = DecisionTreeRegressor()
build3 = model3.fit(X_train.reshape(-1, 1), y_train)
predict3 = model3.predict(X_test.reshape(-1, 1))

df3 = pd.DataFrame(list(zip(y_test, predict3)), columns=["Actual Values", "Predicted Values"])

import pandas as pd

# Assuming df3 is your DataFrame
data = {'Actual Values': df3['Actual Values'], 'Predicted Values': df3['Predicted Values']}
df_table = pd.DataFrame(data)

styled_df_table = df_table.head(10).style.set_table_styles([
    {'selector': 'th.row_heading', 'props': 'display: none;'}
]).set_properties(**{'text-align': 'center'})

styled_df_table

	Actual Values	Predicted Values
0	1023.250000	1022.950000
1	392.050000	383.900000
2	550.900000	528.700000
3	301.100000	303.883333
4	362.300000	348.425000
5	212.900000	211.000000
6	264.000000	264.450000
7	631.750000	601.550000
8	967.900000	966.050000
9	574.700000	549.300000

import plotly.express as px

# Create the bar plot
fig = px.bar(df3.head(50), title='Simple Linear Regression', barmode='group', color_discrete_sequence=px.colors.qualitative.Plotly)

# Customize the layout
fig.update_layout(
    xaxis_title='Index',
    yaxis_title='Values',
    legend_title='Data',
    width=1200,
    height=600,
    xaxis_tickangle=-45,  # Rotate x-axis labels for better readability
    showlegend=True,      # Show legend
    font=dict(size=12),   # Set font size
    plot_bgcolor='rgba(0,0,0,0)',  # Set plot background color
    paper_bgcolor='rgba(0,0,0,0)', # Set paper background color
    bargap=0.1,           # Set gap between bars
    xaxis=dict(showgrid=True, gridcolor='rgba(0,0,0,0.1)'), # Show x-axis gridlines
    yaxis=dict(showgrid=True, gridcolor='rgba(0,0,0,0.1)')  # Show y-axis gridlines
)

# Add data labels to the bars
fig.update_traces(texttemplate='%{y}', textposition='outside')

# Display the plot
fig.show()

// Listen for the removal of the full notebook cells var notebookContainer = gd.closest('#notebook-container'); if (notebookContainer) {{ x.observe(notebookContainer, {childList: true}); }}

// Listen for the clearing of the current output cell var outputEl = gd.closest('.output'); if (outputEl) {{ x.observe(outputEl, {childList: true}); }}

                    })                };                });            </script>        </div>

accuracy3 = r2_score(y_test, predict3)
print("Accuracy of Decision Tree Regression:", accuracy3)

Accuracy of Decision Tree Regression: 0.9973466715973613

Random Forest Regression

model4 = RandomForestRegressor(n_estimators=100)
build4 = model4.fit(X_train.reshape(-1, 1), y_train)
predict4 = model4.predict(X_test.reshape(-1, 1))

df4 = pd.DataFrame(list(zip(y_test, predict4)), columns=["Actual Values", "Predicted Values"])

import pandas as pd

# Assuming df4 is your DataFrame
data = {'Actual Values': df4['Actual Values'], 'Predicted Values': df4['Predicted Values']}
df_table = pd.DataFrame(data)

styled_df_table = df_table.head(10).style.set_table_styles([
    {'selector': 'th.row_heading', 'props': 'display: none;'}
]).set_properties(**{'text-align': 'center'})

styled_df_table

	Actual Values	Predicted Values
0	1023.250000	1019.395500
1	392.050000	392.366250
2	550.900000	530.443500
3	301.100000	303.901994
4	362.300000	348.924955
5	212.900000	211.364093
6	264.000000	264.753700
7	631.750000	603.482300
8	967.900000	968.403500
9	574.700000	550.432000

import plotly.express as px

# Create the bar plot
fig = px.bar(df4.head(50), title='Simple Linear Regression', barmode='group', color_discrete_sequence=px.colors.qualitative.Plotly)

# Customize the layout
fig.update_layout(
    xaxis_title='Index',
    yaxis_title='Values',
    legend_title='Data',
    width=1200,
    height=600,
    xaxis_tickangle=-45,  # Rotate x-axis labels for better readability
    showlegend=True,      # Show legend
    font=dict(size=12),   # Set font size
    plot_bgcolor='rgba(0,0,0,0)',  # Set plot background color
    paper_bgcolor='rgba(0,0,0,0)', # Set paper background color
    bargap=0.1,           # Set gap between bars
    xaxis=dict(showgrid=True, gridcolor='rgba(0,0,0,0.1)'), # Show x-axis gridlines
    yaxis=dict(showgrid=True, gridcolor='rgba(0,0,0,0.1)')  # Show y-axis gridlines
)

# Add data labels to the bars
fig.update_traces(texttemplate='%{y}', textposition='outside')

# Display the plot
fig.show()

// Listen for the removal of the full notebook cells var notebookContainer = gd.closest('#notebook-container'); if (notebookContainer) {{ x.observe(notebookContainer, {childList: true}); }}

// Listen for the clearing of the current output cell var outputEl = gd.closest('.output'); if (outputEl) {{ x.observe(outputEl, {childList: true}); }}

                    })                };                });            </script>        </div>

accuracy4 = r2_score(y_test, predict4)
print("Accuracy of Random Forest Regression:", accuracy4)

Accuracy of Random Forest Regression: 0.9979437067080489

Results Visualization:

The script visualizes the accuracies of the four models using a bar chart, making it easy to compare their performance.

dict1 = {
    "Model": ["Simple Linear Regression", "Support Vector Regression", "Decision Tree Regression", "Random Forest Regression"],
    "Accuracy": np.array([accuracy1, accuracy2, accuracy3, accuracy4])
}
df = pd.DataFrame(dict1)
styled_df = df.style.set_table_styles([{'selector': 'tr:hover','props': [('background-color', 'yellow')]}])
styled_df

	Model	Accuracy
0	Simple Linear Regression	0.998493
1	Support Vector Regression	0.979936
2	Decision Tree Regression	0.997347
3	Random Forest Regression	0.997944

models = ['SLR', 'SVR', 'DTR', 'RFR']
acc = [accuracy1, accuracy2, accuracy3, accuracy4]

plt.figure(figsize=(20, 10))
plt.title('Comparison of Accuracies of models')
plt.yticks(np.linspace(0, 1, 21))
plt.ylabel("Accuracy")
plt.xlabel("Models")

# Create a DataFrame from the models and acc arrays
df_acc = pd.DataFrame({'Model': models, 'Accuracy': acc})

plot = sns.barplot(x='Model', y='Accuracy', data=df_acc, palette='viridis')
for p in plot.patches:
    plot.annotate(format(p.get_height(), '.2f'), 
                   (p.get_x() + p.get_width() / 2., p.get_height()), 
                   ha = 'center', va = 'center', 
                   xytext = (0, 9), 
                   textcoords = 'offset points')

plt.show()

Find out the closing price of the company of that day

html_table = future_stock_value.to_html(index=False, justify='center', classes='table table-striped table-hover table-bordered')
styled_table = f'<style>table {{border-collapse: collapse; width: 50%;}} th, td {{border: 1px solid #dddddd; text-align: left; padding: 8px;}} th {{background-color: #f2f2f2;}}</style>{html_table}'
display(HTML(styled_table))

Date	Open	Predicted
11-May-22	718.0	717.130664

Predicting Future Stock Price:

Finally, the script predicts the closing price of a stock for a specific date using the model with the highest accuracy.

models = np.array(df['Model'])
accuracy = np.array(df['Accuracy'])

highest_accuracy=0.0
best_model=""

for i in range(len(accuracy)) :
    if accuracy[i] >= highest_accuracy :
        highest_accuracy=accuracy[i]
        best_model=models[i]

slr, svr, dtr, rfr = [], [], [], []

if best_model == models[0] :
    future_stock_value['Predicted'] = model1.predict(future_stock_value.Open.values.reshape(-1, 1))
elif best_model == models[1] :
    future_stock_value['Predicted'] = model2.predict(future_stock_value.Open.values.reshape(-1, 1))
elif best_model == models[2] :
    future_stock_value['Predicted'] = model3.predict(future_stock_value.Open.values.reshape(-1, 1))
elif best_model == models[3] :
    future_stock_value['Predicted'] = model4.predict(future_stock_value.Open.values.reshape(-1, 1))

print(future_stock_value.to_string(index=False))

     Date  Open  Predicted
11-May-22 718.0  717.24537

fig, ax = plt.subplots()
ax.axis('off')

props = dict(boxstyle='round', facecolor='lightblue', alpha=0.5)
ax.text(0.5, 0.5, 'THANK YOU', va='center', ha='center', fontsize=30, fontweight='bold', bbox=props)

plt.show()

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Comparison_Of_Accuracies_Of_Models.png		Comparison_Of_Accuracies_Of_Models.png
Data_Visualization_Open_Vs_Close.png		Data_Visualization_Open_Vs_Close.png
Dataset.csv		Dataset.csv
Decision_Tree_regression.png		Decision_Tree_regression.png
LICENSE		LICENSE
README.md		README.md
Scatter_Plot_and_Histogram.png		Scatter_Plot_and_Histogram.png
Simple_Liner_Regression.png		Simple_Liner_Regression.png
Stock_Prediction_Project_Main_File.ipynb		Stock_Prediction_Project_Main_File.ipynb
Support-Vector-regression.png		Support-Vector-regression.png
Support_Vector_Regression.png		Support_Vector_Regression.png
index.html		index.html
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stock-Prediction

Simple Linear Regression: A basic regression technique that models the relationship between the independent variable (time) and the dependent variable (stock price) using a linear equation.

Support Vector Regression (SVR): A regression algorithm that constructs a hyperplane or set of hyperplanes in a high-dimensional space to optimize the margin between the data points and the hyperplane(s).

Decision Tree Regression: A non-linear regression technique that uses a decision tree model to predict the stock price by recursively partitioning the data into subsets based on the feature values.

Random Forest Regression: An ensemble learning method that constructs multiple decision trees during training and outputs the average prediction of the individual trees for improved accuracy and robustness.

Data Import and Cleaning

Data Visualization

Model Building

Model Evaluation

Results Visualization

Predicting Future Stock Price

Data Import and Cleaning:

Import Libraries and Dataset

Information about the dataset

Dataset Columns

Dataset Info

Data Cleaning

Data Visualization:

Import Models

Model Building:

Simple Linear Regression

Support Vector Regression

Decision Tree Regression

Random Forest Regression

Results Visualization:

Find out the closing price of the company of that day

Predicting Future Stock Price:

About

Releases

Packages

Languages

License

yuvrajtiwary-bitmesraece/Stock-Prediction

Folders and files

Latest commit

History

Repository files navigation

Stock-Prediction

Simple Linear Regression: A basic regression technique that models the relationship between the independent variable (time) and the dependent variable (stock price) using a linear equation.

Support Vector Regression (SVR): A regression algorithm that constructs a hyperplane or set of hyperplanes in a high-dimensional space to optimize the margin between the data points and the hyperplane(s).

Decision Tree Regression: A non-linear regression technique that uses a decision tree model to predict the stock price by recursively partitioning the data into subsets based on the feature values.

Random Forest Regression: An ensemble learning method that constructs multiple decision trees during training and outputs the average prediction of the individual trees for improved accuracy and robustness.

Data Import and Cleaning

Data Visualization

Model Building

Model Evaluation

Results Visualization

Predicting Future Stock Price

Data Import and Cleaning:

Import Libraries and Dataset

Information about the dataset

Dataset Columns

Dataset Info

Data Cleaning

Data Visualization:

Import Models

Model Building:

Simple Linear Regression

Support Vector Regression

Decision Tree Regression

Random Forest Regression

Results Visualization:

Find out the closing price of the company of that day

Predicting Future Stock Price:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages