Source author record

Hardik I. Parikh

Hardik I. Parikh appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Genomics Other Quantitative Biology

Catalog footprint

What is connected

3works

2topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2015arXiv

Massive Multi-Omics Microbiome Database (M3DB): A Scalable Data Warehouse and Analytics Platform for Microbiome Datasets

Massive Multi-Omics Microbiome Database (M3DB) is a data warehousing and analytics solution designed to handle diverse, complex, and unprecedented volumes of sequence and taxonomic classification data obtained in a typical microbiome project using NGS technologies. M3DB is a platform developed on Apache Hadoop, Apache Hive and PostgreSQL technologies. It enables users to store, analyze and manage high volumes of data, and also provides them the ability to query it in a fast and efficient manner. The M3DB framework includes command line tools to process and store microbiome data, along with an easy-to-use web-interface for uploading, querying, analyzing and visualizing the data and/or results. Availability: The source-code of M3DB is freely available for download at http://www.github.com/nisheth/M3DB.

preprint2015arXiv

MEEPTOOLS: A maximum expected error based FASTQ read filtering and trimming toolkit

Next generation sequencing technology rapidly produces massive volume of data and quality control of this sequencing data is essential to any genomic analysis. Here we present MEEPTOOLS, which is a collection of open-source tools based on maximum expected error as a percentage of read length (MEEP score) to filter, trim, truncate and assess next generation DNA sequencing data in FASTQ file format. MEEPTOOLS provides a non-traditional approach towards read filtering/trimming based on maximum error probabilities of the bases in the read on a non-logarithmic scale. This method simultaneously retains more reliable bases and removes more unreliable bases than the traditional quality filtering strategies.

preprint2015arXiv

MeFiT: Merging and Filtering Tool for Illumina Paired-End Reads for 16S rRNA Amplicon Sequencing

Recent advances in next-generation sequencing have revolutionized genomic research. 16S rRNA amplicon sequencing using paired-end sequencing on the MiSeq platform from Illumina, Inc., is being used to characterize the composition and dynamics of extremely complex/diverse microbial communities. For this analysis on the Illumina platform, merging and quality filtering of paired-end reads are essential first steps in data analysis to ensure the accuracy and reliability of downstream analysis. We have developed the Merging and Filtering Tool (MeFiT) to combine these pre-processing steps into one simple, intuitive pipeline. MeFiT provides an open-source solution that permits users to merge and filter paired end illumina reads based on user-selected quality parameters. The tool has been implemented in python and the source-code is freely available at https://github.com/nisheth/MeFiT.

Hardik I. Parikh

What is connected

Connect this record

See the researcher in context

Building this map preview

3 published item(s)

Massive Multi-Omics Microbiome Database (M3DB): A Scalable Data Warehouse and Analytics Platform for Microbiome Datasets

MEEPTOOLS: A maximum expected error based FASTQ read filtering and trimming toolkit

MeFiT: Merging and Filtering Tool for Illumina Paired-End Reads for 16S rRNA Amplicon Sequencing