Streamlining Data Wrangling with the CP-Seeker Post-Processing App

CP-Seeker Post-Processing App
Author

Adam Cseresznye

Published

November 13, 2023

Photo by National Cancer Institute on UnSplash

Welcome to the CP-Seeker Post-Processing App! This application is designed to streamline the organization of CP-Seeker outputs, simplifying additional quantitation steps required for further pipeline processes. CP-Seeker, a novel custom-built software developed by Ronan Cariou and team, is based on the R-programming language. Its purpose is to seamlessly conduct peak integration and quantification of chlorinated paraffins from high-resolution LC-MS data.

Note

You can access the project’s app through its Streamlit website.

Introduction

The environmental analysis of chlorinated paraffins (CPs) poses significant challenges due to their intricate composition, varying congener patterns in the environment, and the absence of suitable reference standards. The presence of numerous isomers, enantiomers, and diastereomers within CP mixtures represents a considerable hurdle.

Introducing CP-Seeker, a custom-built, user-friendly, and automated software dedicated to peak integration of CPs and related chemical families. Developed at LABERCA, a joint research unit of ONIRIS and INRAE in Nantes, France. CP-Seeker is an open-source R software built using the Shiny framework.

The CP-Seeker post-processing app automates the data-wrangling once the data, including confidence levels and peak areas, is obtained from CP-Seeker. Manual processes often involve repetitive copy-pasting of data, introducing potential errors and tediousness. As depicted in Figure 1, CP-Seeker’s output poses challenges as injected samples are spread across separate excel sheets. Moreover, each sample encompasses two dataframes on the same tab: - one containing confidence scores assigned by CP-Seeker to detected congeners, - and the other holding the actual peak areas.

Figure 1: Illustrative Application of CP-Seeker Post-Processing App

The CP-Seeker post-processing app swiftly imports all datasets from various tabs and restructures the dataframes, akin to the pandas melt function. This transformation organizes congeners neatly, stacking them one beneath the other, rather than grouping them separately by carbon number and chlorine content. Consequently, all samples are presented side by side, eliminating the need for different tabs. Additionally, the app offers the ability to filter congeners and peak areas based on confidence intervals. In other words, congeners assigned lower confidence scores can be automatically excluded from further processing. Once the automated data wrangling is complete, the data is ready for further analysis using software/workflow of your choice.

We believe the CP-Seeker Post-Processing App significantly cuts down data preparation time for subsequent analysis stages, thereby enhancing productivity and removing potential errors associated with manual data copying and pasting.

How to use the App

  1. Upload your CP-Seeker output files here.
  2. Optionally adjust the confidence level, which is set to 80% by default.
  3. Download the filtered and merged dataset.

Get in touch

Did the app help with your research? Any ideas for making it better? Get in touch! I would love to hear from you.