l i n u x - u s e r s - g r o u p - o f - d a v i s
Next Meeting:
July 7: Social gathering
Next Installfest:
Latest News:
Jun. 14: June LUGOD meeting cancelled
Page last updated:
2008 Jan 19 18:59

The following is an archive of a post made to our 'vox-tech mailing list' by one of its subscribers.

Report this post as spam:

(Enter your email address)
Re: [vox-tech] Finding the right tool for parsing
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [vox-tech] Finding the right tool for parsing

On Sat, Jan 19, 2008 at 01:50:08PM -0800, Alex Mandel wrote:
> I've got a big text file to parse(example below)

Stick it on a website somewhere... it's hard to write a parser when inline
email handling might be misformatting things.

> The only pattern I can find to parse on is a:

Looks like there's lots of patterns to parse on.  That "." on line by
itself, the address line with two commas in it, the " mi " line, etc.

Seems like a simple job in perl to match any of those.

See if a perl pattern match on $/ = "\n.\n" might work.

ie: given the file named foo

and perl:
perl -ne 'BEGIN{$/="\n.\n"} print "+++$_---"' foo

you get:

Demonstrating that perl is treating everything (including newlines)
between the "."-on-a-line-by-itself as a unit (in the $_ variable).

Ted Deppner
vox-tech mailing list

LUGOD Group on LinkedIn
Sign up for LUGOD event announcements
Your email address:
LUGOD Group on Facebook
'Like' LUGOD on Facebook:

Hosting provided by:
Sunset Systems
Sunset Systems offers preconfigured Linux systems, remote system administration and custom software development.

LUGOD: Linux Users' Group of Davis
PO Box 2082, Davis, CA 95617
Contact Us

LUGOD is a 501(c)7 non-profit organization
based in Davis, California
and serving the Sacramento area.
"Linux" is a trademark of Linus Torvalds.

Sponsored in part by:
O'Reilly and Associates
For numerous book donations.