Command Line Tool API

This page contains a description of all the subcommand inputs of the command line tool velocyto. For a usage guide refer to the command line tool tutorial

velocyto

velocyto [OPTIONS] COMMAND [ARGS]...

Options

--version

Show the version and exit.

run

Runs the velocity analysis outputting a loom file

BAMFILE bam file with sorted reads

GTFFILE genome annotation file

velocyto run [OPTIONS] BAMFILE... GTFFILE

Options

-b, --bcfile <bcfile>

Valid barcodes file, to filter the bam. If –bcfile is not specified all the cell barcodes will be included. Cell barcodes should be specified in the bcfile as the CB tag for each read

-o, --outputfolder <outputfolder>

Output folder, if it does not exist it will be created.

-e, --sampleid <sampleid>

The sample name that will be used to retrieve informations from metadatatable

-s, --metadatatable <metadatatable>

Table containing metadata of the various samples (csv formatted, rows are samples and cols are entries)

-m, --mask <mask>

.gtf file containing intervals to mask

-c, --onefilepercell

If this flag is used every bamfile passed is interpreted as an independent cell, otherwise multiple files are interpreted as batch of different cells to be analyzed together. Important: cells reads should not be distributed over multiple bamfiles is not supported!! (default: off)

-l, --logic <logic>

The logic to use for the filtering (default: Default)

-U, --without-umi

If this flag is used the data is assumed UMI-less and reads are counted instead of molecules (default: off)

-u, --umi-extension <umi_extension>

In case UMI is too short to guarantee uniqueness (without information from the ampping) set this parameter to chr, Gene ro [N]bp If set to chr the mapping position (binned to 10Gb intervals) will be appended to UB (ideal for InDrops+dropEst). If set to Gene then the GX tag will be appended to the UB tag. If set to [N]bp the first N bases of the sequence will be used to extend UB (ideal for STRT). (Default: no)

-M, --multimap

Consider not unique mappings (not reccomended)

-@, --samtools-threads <samtools_threads>

The number of threads to use to sort the bam by cellID file using samtools

--samtools-memory <samtools_memory>

The number of MB used for every thread by samtools to sort the bam file

-t, --dtype <dtype>

The dtype of the loom file layers - if more than 6000 molecules/reads per gene per cell are expected set uint32 to avoid truncation (default run: uint32)

-d, --dump <dump>

For debugging purposes only: it will dump a molecular mapping report to hdf5. –dump N, saves a cell every N cells. If p is prepended a more complete (but huge) pickle report is printed (default: 0)

-v, --verbose

Set the vebosity level: -v (only warnings) -vv (warnings and info) -vvv (warnings, info and debug)

Arguments

BAMFILE

Required argument(s)

GTFFILE

Required argument

run-dropest

Runs the velocity analysis on DropEst preprocessed data

BAMFILE bam files to be analyzed

GTFFILE genome annotation file

velocyto run-dropest [OPTIONS] BAMFILE GTFFILE

Options

-b, --bcfile <bcfile>

Valid barcodes file, to filter the bam. If –bcfile is not specified the file will be searched in the default position outputted by velocyto tools dropest_bc_correct. Otherwise an error will be thrown

-l, --logic <logic>

The logic to use for the filtering (default: Default)

-o, --outputfolder <outputfolder>

Output folder, if it does not exist it will be created.

-e, --sampleid <sampleid>

The sample name that will be used as a the filename of the output.

-m, --repmask <repmask>

.gtf file containing intervals to mask (Optional)

-@, --samtools-threads <samtools_threads>

The number of threads to use to sort the bam by cellID file using samtools

--samtools-memory <samtools_memory>

The number of MB used for every thread by samtools to sort the bam file

-t, --dtype <dtype>

The dtype of the loom file layers - if more than 6000 molecules/reads per gene per cell are expected set uint32 to avoid truncation (default run_dropest: uint32)

-d, --dump <dump>

For debugging purposes only: it will dump a molecular mapping report to hdf5. –dump N, saves a cell every N cells. If p is prepended a more complete (but huge) pickle report is printed (default: 0)

-v, --verbose

Set the vebosity level: -v (only warnings) -vv (warnings and info) -vvv (warnings, info and debug)

Arguments

BAMFILE

Required argument

GTFFILE

Required argument

run-smartseq2

Runs the velocity analysis on SmartSeq2 data (independent bam file per cell)

[BAMFILES, …] a sequence of bam files to be analyzed (e.g. use a wild-card expansion)

GTFFILE genome annotation file

velocyto run-smartseq2 [OPTIONS] BAMFILES... GTFFILE

Options

-o, --outputfolder <outputfolder>

Output folder, if it does not exist it will be created.

-e, --sampleid <sampleid>

The sample name that will be used as a the filename of the output.

-m, --repmask <repmask>

.gtf file containing intervals to mask

-t, --dtype <dtype>

The dtype of the loom file layers - if more than 6000 molecules/reads per gene per cell are expected set uint32 to avoid truncation (default in run_smartseq2: uint32)

-d, --dump <dump>

For debugging purposes only: it will dump a molecular mapping report to hdf5. –dump N, saves a cell every N cells. If p is prepended a more complete (but huge) pickle report is printed (default: 0)

-v, --verbose

Set the verbosity level: -v (only warnings) -vv (warnings and info) -vvv (warnings, info and debug)

Arguments

BAMFILES

Required argument(s)

GTFFILE

Required argument

run10x

Runs the velocity analysis for a Chromium 10X Sample

10XSAMPLEFOLDER specifies the cellranger sample folder

GTFFILE genome annotation file

velocyto run10x [OPTIONS] SAMPLEFOLDER GTFFILE

Options

-s, --metadatatable <metadatatable>

Table containing metadata of the various samples (csv fortmated rows are samples and cols are entries)

-m, --mask <mask>

.gtf file containing intervals to mask

-l, --logic <logic>

The logic to use for the filtering (default: Default)

-M, --multimap

Consider not unique mappings (not reccomended)

-@, --samtools-threads <samtools_threads>

The number of threads to use to sort the bam by cellID file using samtools

--samtools-memory <samtools_memory>

The number of MB used for every thread by samtools to sort the bam file

-t, --dtype <dtype>

The dtype of the loom file layers - if more than 6000 molecules/reads per gene per cell are expected set uint32 to avoid truncation (default run_10x: uint16)

-d, --dump <dump>

For debugging purposes only: it will dump a molecular mapping report to hdf5. –dump N, saves a cell every N cells. If p is prepended a more complete (but huge) pickle report is printed (default: 0)

-v, --verbose

Set the vebosity level: -v (only warinings) -vv (warinings and info) -vvv (warinings, info and debug)

Arguments

SAMPLEFOLDER

Required argument

GTFFILE

Required argument

tools

helper tools for velocyto

velocyto tools [OPTIONS] COMMAND [ARGS]...

dropest-bc-correct

Using the output of DropEst: (1) Corrects barcodes directly in the bam file (2) Produces a valid barcodes list

BAMFILEPATH - bam file with sorted reads obtained running DropEst

DROPEST-OUT - R dump rds file generated by DropEst

velocyto tools dropest-bc-correct [OPTIONS] BAMFILEPATH DROPEST_OUT

Options

-o, --corrected-output <corrected_output>

(Optional) The file output of the output bam file. Otherwise the file will be outputted in the same folder of the input with the prefix correct_

Arguments

BAMFILEPATH

Required argument

DROPEST_OUT

Required argument