Download a csv from url and make it a dataframe python pandas

Tags:

I am new to python so need a little help here. I have a dataframe with a url column with a link that allows me to download a CSV for each link. My aim is to create a loop/ whatever works so that I can run one command that will allow me to download,read the csv and create a dataframe for each of the rows. Any help would be appreciated. I have attached part of the dataframe below. If the link doesn't work (it probably won't you can just replace it with a link from 'https://finance.yahoo.com/quote/GOOG/history?p=GOOG' (any other company too) and navigate to download csv and use that link.

Dataframe:

Symbol         Link
YI             https://query1.finance.yahoo.com/v7/finance/download/YI?period1=1383609600&period2=1541376000&interval=1d&events=history&crumb=PMHbxK/sU6E
PIH            https://query1.finance.yahoo.com/v7/finance/download/PIH?period1=1383609600&period2=1541376000&interval=1d&events=history&crumb=PMHbxK/sU6E
TURN           https://query1.finance.yahoo.com/v7/finance/download/TURN?period1=1383609600&period2=1541376000&interval=1d&events=history&crumb=PMHbxK/sU6E
FLWS           https://query1.finance.yahoo.com/v7/finance/download/FLWS?period1=1383609600&period2=1541376000&interval=1d&events=history&crumb=PMHbxK/sU6E

Thanks again.

850

asked Nov 05 '18 16:11

cloudly lemons

1 Answers

There are multiple ways to get CSV data from URLs. From your example, namely Yahoo Finance, you can copy the Historical data link and call it in Pandas

...
HISTORICAL_URL = "https://query1.finance.yahoo.com/v7/finance/download/GOOG?period1=1582781719&period2=1614404119&interval=1d&events=history&includeAdjustedClose=true"

df = pd.read_csv(HISTORICAL_URL)

A general pattern could involve tools like requests or httpx to make a GET|POST request and then get the contents to io.

import pandas as pd
import requests
import io

url = 'https://query1.finance.yahoo.com/v7/finance/download/GOOG'
params ={'period1':1538761929,
         'period2':1541443929,
         'interval':'1d',
         'events':'history',
         'crumb':'v4z6ZpmoP98',
        }

r = requests.post(url,data=params)
if r.ok:
    data = r.content.decode('utf8')
    df = pd.read_csv(io.StringIO(data))

To get the params, I just followed the liked and copied everything after ‘?’. Check that they match ;)

Results: enter image description here

Update:

If you can see the raw csv contents directly in url, just pass the url in pd.read_csv Example data directly from url:

data_url ='https://raw.githubusercontent.com/pandas-dev/pandas/master/pandas/tests/data/iris.csv'

df = pd.read_csv(data_url)

answered Nov 14 '22 23:11

Prayson W. Daniel

Related questions
                            
                                VS Code: Tell pylint to ignore the next line?
                            
                                Pandas Python : how to create multiple columns from a list
                            
                                Splitting a string based on a pattern in Python
                            
                                Does Python scoping rule fits the definition of lexical scoping?
                            
                                Scikit learn: forget previous train data
                            
                                Merge list item with previous list item
                            
                                Python `or`, `and` operator precedence example
                            
                                Python3 - ModuleNotFoundError: No module named 'numpy'
                            
                                OpenCV capturing image with black side bars
                            
                                Can anyone explain me what this Python 3 command do?
                            
                                python spacy TypeError: unpackb() got an unexpected keyword argument 'raw'
                            
                                Python, Make an iterative function into a recursive function
                            
                                multiple set operations in python
                            
                                Python can't parse JSON with extra trailing comma
                            
                                Python: Change bound method to another method
                            
                                E1101:Module 'turtle' has no 'forward' member
                            
                                Write BigQuery results to GCS in CSV format using Apache Beam
                            
                                TypeError: __init__() got an unexpected keyword argument 'trainable'
                            
                                Keras: Use categorical_crossentropy without one-hot encoded array of targets
                            
                                Reverse for '' not found. '' is not a valid view function or pattern name - DJANGO

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Download a csv from url and make it a dataframe python pandas

Tags:

python

pandas

jupyter-notebook

cloudly lemons

People also ask

1 Answers

Prayson W. Daniel

Recent Activity

Donate For Us