pipeline.src.flows.isscaap_codes

Functions

join_values_by_column(→ list)

Takes a pandas.DataFrame and returns a list whose length is the same as the

extract_isscaap_codes_pdf(→ io.BytesIO)

parse_pdf(isscaap_codes_pdf)

transform_isscaap_codes(→ pandas.DataFrame)

export_isscaap_codes(→ None)

isscaap_codes_flow(csv_filepath)

Module Contents

pipeline.src.flows.isscaap_codes.join_values_by_column(df: pandas.DataFrame, join_string: str = ' ') list[source]

Takes a pandas.DataFrame and returns a list whose length is the same as the number of columns in the input DataFrame and whose values are made of the values of the corresponding columns joined in a single string.

Parameters:
  • df (pd.DataFrame) – a pd.DataFrame with only string values.

  • join_string (str, optional) – The string used to join values. Defaults to “ “.

Returns:

list

pipeline.src.flows.isscaap_codes.extract_isscaap_codes_pdf() io.BytesIO[source]
pipeline.src.flows.isscaap_codes.parse_pdf(isscaap_codes_pdf: io.BytesIO)[source]
pipeline.src.flows.isscaap_codes.transform_isscaap_codes(isscaap_codes: pandas.DataFrame) pandas.DataFrame[source]
pipeline.src.flows.isscaap_codes.export_isscaap_codes(isscaap_codes: pandas.DataFrame, csv_filepath: str) None[source]
pipeline.src.flows.isscaap_codes.isscaap_codes_flow(csv_filepath: str)[source]