TimeForScience Repo on GitHub

The software below is freely available on my GitHub repository.
- For details, either check the readme on GitHub…
- …or check a description of some of the tools below.
There are no specific warranties of suitability or quality! You might find bugs.

Summary

This is a list of free and open source command line tools.
You can download these programs here on GitHub.
- ZIP file of all 250+ programs: ZIP download (~2 MB)
Perl programs should work with any version of Perl 5 since 2010. Python programs require Python 2.7+.
All these programs run on the command line. None of them use X11.
Most programs also have their own documentation via the “–help” or “-h” option.

A blue snake

Spreadsheet Viewers

For viewing tabular data (plain text format only) directly on the command line. Generally expects tab-delimited input files.

sheet.py

sheet.py is an interactive terminal-based spreadsheet viewer for tab-delimited files. Sort of like a bare-bones read-only version of Excel.

sheet.pl

sheet.pl pre-processes input data into reasonable column-delimited tabular files that you can then pipe into less -S. Similar to unix ‘column’ command.

A blue cat face

Recommended Tools

trash.pl

trash.pl is a “safer rm” that moves files to a new temporary directory in /tmp/ instead of immediately removing them. May fill up your /tmp partition if you delete extremely large files, so beware.

ditto_mark.pl

ditto_mark.pl marks duplicated cells in a tab-delimited file (just like ditto marks in an old ledger). Good for finding duplicates in a visually obvious fashion.

cut.pl

This is a version of “cut” that allows you to output the results in an arbitrary order. For example, cut.pl -f 2,1,3- would switch columns 2 and 1, and leave columns 3 and beyond in the same order.

join.pl

A modified version of UNIX join. It can handle un-sorted input and deal with case-insensitive joins. Can also accept multiple input files all at once.

sort.pl

Can sort compressed (gzip/bzip2) files and can accept header line(s). It uses the fast UNIX sort internally. Frequency-of-use rating: 9/10.

mdverify.pl

mdverify.pl is a script to easily verify a bunch of files with md5 checksums. It runs on both Mac and Linux and can handle several types of input md5 file, unlike normal md5sum.

A blue octopus

Bioinformatics

SAM/BAM → UCSC Browser (.pl)

convert_SAM_or_BAM_for_Genome_Browser.pl converts input BAM/SAM files into tracks for the UC Santa Cruz Genome Browser (UCSC Genome Browser), and provides a track description file.

fasta2gtf.pl

fasta2gtf.pl (a bioinformatics-specific tool) takes a FASTA file and makes a GTF file that spans each chromosome.

A blue snake

Scientific / Data Processing

qplz.pl (qplease.pl)

qplz.pl (“queue please”) can submit jobs to a PBS Pro queue in user-friendly fashion. Tested with PBS Pro version 13 (August 2016). May also work with TORQUE.

rand_lines.pl

Randomly chooses a certain number of lines from a file. Can sample with or without replacement. It can also pull out multi-line records (for example, in a FASTQ file, each record is actually 4 rows). Becomes very slow if files have > 1 million lines.

matrix_from_edge_list.pl

matrix_from_edge_list.pl can turn a 2- or 3-column file into a matrix. The matrix will either be an adjacency matrix (2 column input) or will have the values of each edge (3 column input).

select_best_item.pl

select_best_item.pl picks the best N items (rows) with a given key (in a user-specified column).

Other

Other programs

There are a ton of additional programs on the TimeForScience GitHub repository, some of which have even been properly documented.

A dangerous snake with a sword

Programs that aren’t on GitHub

hue.pl (Philips Hue lights)

Command line program for controlling the Philips Hue colored light system.
The Hue has very minimal software support (almost nonexistent on the desktop and not especially convenient on phones), and a proper light switch costs $60.
But if you have one and want to (say) set up a cron job for turning your lights on at 6 PM, you can use this script for that.
Download the hue.pl command line controller script for the Philips Hue:
- Download link to hue.pl (.gz) (5 KB)
- Browse the hue.pl code as plain text

Free Software

Contents