MSAnalysis
- class MSAnalysis.MS_Analysis(MS_FilePath=None, MS_FilePaths=[], MS_FileType=None, Annotation_FilePath=None, logger=None, ingui=True, longtable=False, longtable_annot=False)
A class to set up the right configurations before performing data analysis
- Parameters:
MS_FilePath (str) – File path of the input MRM transition name file
MS_FilePaths (str) – File paths of the input MRM transition name file (multiple)
MS_FileType (str) – File type of the input MRM transition name file
Annotation_FilePath (str) – The file path to the MS Template Creator annotation file if provided
Area_df (pandas DataFrame) – A data frame of sample as rows and transition names as columns with area as values
logger (object) – logger object created by start_logger in MSOrganiser
ingui (bool) – if True, print analysis status to screen
longtable (bool) – if True, prepare a dataframe to store results in long table
longtable_annot (bool) – if True, prepare a dataframe to store annotation details in long table
- get_Long_Table(allow_multiple_istd=False, concatenation_type=None)
Function to get the long table of the extracted or calculated MRM transition name data.
- Parameters:
allow_multiple_istd (bool) – if True, allow Transition_Annot data by to have mulitple internal standards (in development)
concatenation_type (str) – “rows or columns or None” to indicate if how Sample_Annot should be cleaned before merging with the Long_Table.
- Returns:
Output_df (pandas DataFrame) – A long data frame of with column name Sample_Name, Transition_Name and other relevant data
- get_from_Input_Data(column_name, outputdata=True, allow_multiple_istd=False)
Function to get a specific column from the input MRM transition name data.
- Parameters:
column_name (str) – The name of the column given in the Output_Options.
outputdata (bool) – if True, return the results as a pandas dataframe. Else, nothing is returned
allow_multiple_istd (bool) – if True, allow normalisation of peak area by mulitple internal standards which leads to an expansion of the Output_df
- Returns:
Output_df (pandas DataFrame) – A data frame of sample as rows and transition names as columns with values from the chosen column name
- get_Normalised_Area(analysis_name, outputdata=True, allow_multiple_istd=False, using_multiple_input_files=False, concatenation_type='rows')
Function to calculate the normalised area from the input MRM transition name data and MS Template Creator annotation file.
- Parameters:
analysis_name (str) – The name of the column given in the Output_Options. Should be “normArea by ISTD”
outputdata (bool) – if True, return the results as a pandas dataframe. Else, nothing is returned
allow_multiple_istd (bool) – if True, allow normalisation of peak area by mulitple internal standards (in development)
using_multiple_input_files (bool) – if True, the Area df will be constructed from multiple input files, denoted in MS_FilePaths (in development)
concatenation_type (str) – “rows or columns” to indicate if the Area_df is to be concatenated by row wise or column wise respectively
- Returns:
(list) –
list containing:
norm_Area_df (pandas DataFrame): A data frame of sample as rows and transition names as columns with the normalised area as values
ISTD_Area (pandas DataFrame): A data frame of sample as rows and transition names as columns with the ISTD area as values. Output as excel only at testing mode
ISTD_map_df (pandas DataFrame): A data frame of showing the transition names annotation
ISTD_Report (pandas DataFrame): A data frame of with transition names, its corresponding ISTD as columns. This will be converted to a pdf file page
- get_Analyte_Concentration(analysis_name, outputdata=True, allow_multiple_istd=False, using_multiple_input_files=False, concatenation_type='rows')
Function to calculate the transition names concentration from the input MRM transition name data and MS Template Creator annotation file.
- Parameters:
analysis_name (str) – The name of the column given in the Output_Options. Should be “normConc by ISTD”
outputdata (bool) – if True, return the results as a pandas dataframe. Else, nothing is returned
allow_multiple_istd (bool) – if True, allow normalisation of peak area by mulitple internal standards
using_multiple_input_files (bool) – if True, the Area df will be constructed from multiple input files, denoted in MS_FilePaths (in development)
concatenation_type (str) – “rows or columns” to indicate if the Area_df is to be concatenated by row wise or column wise respectively
- Returns:
(list) –
list containing:
norm_Conc_df (pandas DataFrame): A data frame of sample as rows and transition names as columns with the transition name concentration as values
ISTD_Conc_df (pandas DataFrame): A data frame of sample as rows and transition names as columns with the ISTD concentration as values
ISTD_Samp_Ratio_df (pandas DataFrame): A data frame of with transition names, its corresponding ISTD and ISTD to Sample ratio as columns
Sample_Annot_df (pandas DataFrame): A data frame showing the samples annotation