Karsten Loesing, 29.05.2012 19:43:
I did just that.
https://trac.torproject.org/projects/tor/wiki/doc/DataExtractionForCompariso...
Thanks for creating that page. Looks line a fine start, though you'll want to automate more things when looking at 2012 tarballs.
Well without grep I'd be still copying out nicknames.
grep and friends are fine tools to process Tor descriptors. If you can, find a Unix/Linux-like environment for Windows (Cygwin?) and combine the powers of grep with sort, uniq, and maybe sed or awk. These tools are friggin' fast!
Cygwin might not be the right solution. I would have to compile the tools from source. Lucky me, those tools are available for Windows. Thanks to the people providing the binaries and the docs.
I have to look if they are equal to the tools you have mentioned. awk is named gawk. All of them are command-line tools and I have to learn how to use them.
And I need to figure out how to strip the "r" or copy only the nickname.
If you're comfortable with Java and want to do more fancy stuff with Tor descriptors, take a look at metrics-lib:
https://gitweb.torproject.org/metrics-lib.git
If you're a Python person, you'll like stem, even though it only implements parsing of a subset of Tor descriptors. More to come soon:
Thanks to anyone for coding on them or otherwise maintaining them.
I can't compile stuff, read or write code. That includes scripting. There's a reason why I'm on Windows.
Regards, Sebastian