Author: David Anderson Date: To: hampshire Subject: Re: [Hampshire] extracting phrases from a file.
On Mon, 12 Sep 2011 10:17:44 +0100
James Courtier-Dutton <james.dutton@???> wrote:
> Hi.
>
> I have a large file that contains snips of http pages.
> Each line is like this:
> ....some junk.....<a href="some url"></a>
>
> I want extract the "some url" bits. I.e. Remove the href.
> You can probably do this quite easily in perl.
> Are there any nice short programs to do this?
> Is it easier to do in some other language?
Python and BeautifulSoup would make this very easy