Source author record

Nihar U. Sheth

Nihar U. Sheth appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Genomics Other Quantitative Biology Quantitative Methods

Catalog footprint

What is connected

5works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2015arXiv

Massive Multi-Omics Microbiome Database (M3DB): A Scalable Data Warehouse and Analytics Platform for Microbiome Datasets

Massive Multi-Omics Microbiome Database (M3DB) is a data warehousing and analytics solution designed to handle diverse, complex, and unprecedented volumes of sequence and taxonomic classification data obtained in a typical microbiome project using NGS technologies. M3DB is a platform developed on Apache Hadoop, Apache Hive and PostgreSQL technologies. It enables users to store, analyze and manage high volumes of data, and also provides them the ability to query it in a fast and efficient manner. The M3DB framework includes command line tools to process and store microbiome data, along with an easy-to-use web-interface for uploading, querying, analyzing and visualizing the data and/or results. Availability: The source-code of M3DB is freely available for download at http://www.github.com/nisheth/M3DB.

preprint2015arXiv

MEEPTOOLS: A maximum expected error based FASTQ read filtering and trimming toolkit

Next generation sequencing technology rapidly produces massive volume of data and quality control of this sequencing data is essential to any genomic analysis. Here we present MEEPTOOLS, which is a collection of open-source tools based on maximum expected error as a percentage of read length (MEEP score) to filter, trim, truncate and assess next generation DNA sequencing data in FASTQ file format. MEEPTOOLS provides a non-traditional approach towards read filtering/trimming based on maximum error probabilities of the bases in the read on a non-logarithmic scale. This method simultaneously retains more reliable bases and removes more unreliable bases than the traditional quality filtering strategies.

preprint2015arXiv

MeFiT: Merging and Filtering Tool for Illumina Paired-End Reads for 16S rRNA Amplicon Sequencing

Recent advances in next-generation sequencing have revolutionized genomic research. 16S rRNA amplicon sequencing using paired-end sequencing on the MiSeq platform from Illumina, Inc., is being used to characterize the composition and dynamics of extremely complex/diverse microbial communities. For this analysis on the Illumina platform, merging and quality filtering of paired-end reads are essential first steps in data analysis to ensure the accuracy and reliability of downstream analysis. We have developed the Merging and Filtering Tool (MeFiT) to combine these pre-processing steps into one simple, intuitive pipeline. MeFiT provides an open-source solution that permits users to merge and filter paired end illumina reads based on user-selected quality parameters. The tool has been implemented in python and the source-code is freely available at https://github.com/nisheth/MeFiT.

preprint2014arXiv

In Silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem Cell Transplant Donors and Recipients: Understanding the Quantitative Immuno-biology of Allogeneic Transplantation

Donor T cell mediated graft vs. host effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the HLA in each donor-recipient pair (DRP) undergoing stem cell transplantation (SCT). Whole exome sequencing has demonstrated extensive nucleotide sequence variation in HLA-matched DRP. Non-synonymous single nucleotide polymorphisms (nsSNPs) in the GVH direction (polymorphisms present in recipient and absent in donor) were identified in 4 HLA-matched related and 5 unrelated DRP. The nucleotide sequence flanking each SNP was obtained utilizing the ANNOVAR software package. All possible nonameric-peptides encoded by the non-synonymous SNP were then interrogated in-silico for their likelihood to be presented by the HLA class I molecules in individual DRP, using the Immune-Epitope Database (IEDB) SMM algorithm. The IEDB-SMM algorithm predicted a median 18,396 peptides/DRP which bound HLA with an IC50 of <500nM, and 2254 peptides/DRP with an IC50 of <50nM. Unrelated donors generally had higher numbers of peptides presented by the HLA. A similarly large library of presented peptides was identified when the data was interrogated using the Net MHCPan algorithm. These peptides were uniformly distributed in the various organ systems. The bioinformatic algorithm presented here demonstrates that there may be a high level of minor histocompatibility antigen variation in HLA-matched individuals, constituting an HLA-specific alloreactivity potential. These data provide a possible explanation for how relatively minor adjustments in GVHD prophylaxis yield relatively similar outcomes in HLA matched and mismatched SCT recipients.

preprint2014arXiv

Whole Exome Sequencing to Estimate Alloreactivity Potential Between Donors and Recipients in Stem Cell Transplantation

Whole exome sequencing was performed on HLA-matched stem cell donors and transplant recipients to measure sequence variation contributing to minor histocompatibility antigen differences between the two. A large number of nonsynonymous single nucleotide polymorphisms were identified in each of the nine unique donor-recipient pairs tested. This variation was greater in magnitude in unrelated donors as compared with matched related donors. Knowledge of the magnitude of exome variation between stem cell transplant recipients and donors may allow more accurate titration of immunosuppressive therapy following stem cell transplantation.

Nihar U. Sheth

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Massive Multi-Omics Microbiome Database (M3DB): A Scalable Data Warehouse and Analytics Platform for Microbiome Datasets

MEEPTOOLS: A maximum expected error based FASTQ read filtering and trimming toolkit

MeFiT: Merging and Filtering Tool for Illumina Paired-End Reads for 16S rRNA Amplicon Sequencing

In Silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem Cell Transplant Donors and Recipients: Understanding the Quantitative Immuno-biology of Allogeneic Transplantation

Whole Exome Sequencing to Estimate Alloreactivity Potential Between Donors and Recipients in Stem Cell Transplantation