How Steven Salzberg might approach Computer Science

Computer science. It’s a rather broad term, isn’t it? When I hear it, I don't just think of abstract theories or pretty interfaces. I think of problems. Real, tangible problems that can be broken down into logical steps, into algorithms. It’s about how we can take a massive amount of data – a genome, for instance, which is just a very long string of letters – and make sense of it. How do we find patterns? How do we compare one string to another to understand evolutionary relationships or identify disease-causing mutations?

The core of it, for me, lies in efficiency and accuracy. Can we design an algorithm that sorts through millions of DNA sequences in a reasonable time? And more importantly, can we trust the output? We're constantly battling the false positive rate, especially when we're trying to find something as subtle as a single nucleotide polymorphism or a novel gene. If our alignment algorithm is too lenient, we’ll flag things that aren't actually there. That’s a waste of experimental resources and leads to flawed conclusions.

We need tools that are reproducible. If I can’t run your code on my machine and get the same results, then your findings are suspect. Open source is critical here. It allows for scrutiny, for improvement, and for others to build upon. The real power of computer science in biology isn't some magical black box; it’s in the rigorous development and application of well-defined computational methods. Let’s look at the data. Let’s test our hypotheses with robust statistics. That’s how we move forward.

Imagined perspective — an AI synthesis grounded in Steven Salzberg’s recorded ideas and methods, not a quotation or a statement they actually made.

Chat with Steven SalzbergComputer Science on Feynman