The Blob Storage Interface class
azure_helper.utils.blob_storage_interface
BlobStorageInterface
Source code in azure_helper/utils/blob_storage_interface.py
|
|
__init__(storage_acct_name, storage_acct_key)
Class responsible to interact with an existing Azure Storage Account.
It uses a connection string to connect to the Storage Account.
Information
To get the key of this storage account we use the following command with the azure-cli.
This class is responsible for :
- Creating a container in the storage account.
- Uploading a dataframe (as a
csv
for now) inside a blob in one of the container of the storage account. - Download a
csv
from a blob in one of the container of the storage account and render it as a pandas dataframe.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
storage_acct_name |
str
|
The name of the storage account to which you want to connect. |
required |
storage_acct_key |
str
|
The account key of the storage account. |
required |
Source code in azure_helper/utils/blob_storage_interface.py
create_container(container_name)
Create a container inside the storage account.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
container_name |
str
|
the name of the container you want to create. This name can only contains alphanumeric numbers and dashes '-'. |
required |
Source code in azure_helper/utils/blob_storage_interface.py
upload_df_to_blob(dataframe, container_name, blob_path)
Upload a pandas dataframe as a csv
file inside a blob.
Eg the following code.
Upload the dataframes x_train
and y_train
as x_train.csv
and y_train.csv
in the following way.
Attention
As of now, there is no data versioning. Meaning that if the blob_path
already exists, it will be
overwritten with new datas.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
dataframe |
pd.DataFrame
|
The dataframe you want to upload. |
required |
container_name |
str
|
The name of the container on which you want to upload the dataframe. |
required |
blob_path |
str
|
The path to the csv |
required |
Source code in azure_helper/utils/blob_storage_interface.py
download_blob_to_df(container_name, blob_path)
Download a csv
file a the given blob_path
location and renders it as a pandas datatrame.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
container_name |
str
|
The name of the container. |
required |
blob_path |
str
|
The path to the |
required |
Returns:
Type | Description |
---|---|
pd.DataFrame
|
pd.DataFrame: the |