Settings

Theme

Handbook of Text manipulation on Unix

ibm.com

316 points by AbyCodes 14 years ago · 24 comments

Reader

Nate75Sanders 14 years ago

Related to this, pyp is worth taking a look at if you're interested in doing manipulation using python's libraries, but on the command line:

http://code.google.com/p/pyp/

luriel 14 years ago

The Unix Programming Environment by Kernighan and Pike and The AWK Programming Language are still the best books one can read about Unix text manipulation, and about Unix, period. (Part of the point is that in Unix text is supposed to be the universal language).

aghull 14 years ago

I like how it's laid out from the most specific tools that are easy to understand and eventually leads to the pocketknives of sed and awk that beginners might not need until they've exhausted the potential of the previous commands.

bradly 14 years ago

Unix for Poets is a great set of exercises for someone wanting to learn more about text manipulation with Unix tools.

http://www.iro.umontreal.ca/~felipe/IFT6010-Automne2011/reso...

skrubly 14 years ago

Thanks for this! I really like these kinds of summaries, because while I love grep and cut and wc and perl, there are commands in here I really haven't heard of.

Plus I enjoy stringing together one-off filters longer than my arm.

webnrrd2k 14 years ago

If you like this, then check out Unix Power Tools. It's full of exactly this kind of stuff, with broader and deeper coverage. I highly recommend it -- I consider it one of the top ten or so books for a new programmer to spend some time with.

Nate75Sanders 14 years ago

One useful addition to the section on streams would have been that of process substitution:

http://tldp.org/LDP/abs/html/process-sub.html

This allows you to have more than just the standard streams.

pkrumins 14 years ago

Also take a look at my 3 e-books on awk, sed and perl: http://www.catonmat.net/books/

microtonal 14 years ago

I once wrote this introduction to UNIX (which is unfortunately not complete, I lost the DocBook sources), that also provides an introduction to text manipulation.

http://danieldk.eu/Writings/unixsystems.pdf

pseudoswamy 14 years ago

Good post. How can I tell if a tool supports UTF-8 (or some other encoding) or not?

gasda 14 years ago

join was new to me. I like it....

Always happy to learn a new command.

agentgt 14 years ago

This used to be a great site (ignore its very un-PC site name):

http://bashcurescancer.com/

It seems the site is down.

indubitably 14 years ago

Thanks for this, had never heard of csplit. Too bad the OSX version sucks.

gurraman 14 years ago

Sort of related: rpl[1] is an often overlooked tool for replacing text across multiple files. Terser than "perl pie" and a few nice features like simulation mode.

[1] http://www.laffeycomputer.com/rpl.html

zwischenzug 14 years ago

So sad that the writer lets himself down in the first line.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection