brite_etl.core.operations

Operations in general are functions that are done on a single frame, but no specific frame. Specific frame functions belong in the frame class themselves.

Functions

hash_cols(df, cols=None)[source]

Hash columns with MD5

Returns a dataframe containing 1 MD5 hash of the selected contents of the passed dataframe

Parameters:
  • df (DataFrame) – The dataframe to hash
  • cols (list of str, optional) – Which columns to hash (the default is None, which hashes all columns)
Returns:

DataFrame – Dataframe with hashes for selected columns