Parse some html with php and output to text

Hi,

As the subject says I want to parse some html and output the contents of certain classes to a txt.
I only want the txt from the headword classes and then the following inflection classes. Nothing else of the document.

Example of the html:

[php]autentifisere
-te;
-ing gi (et dokument) en form som garanterer ekthet


Etym.: jf.
â–¶
autentisere
415863 23 7
gå god for en tings ekthet
autentisereautentisere autentiserea autentisere º
autentisere -te; -ing[/php]

The output I want to end up with is:
"
[php]autentifisere
atentifiserte
autentifiserte
autentisere
autentiserte
autentifisering[/php]
"

As you might have guessed this is a wordlist in html that I want to make a txt file. And eventually make an sql database out of it. I got the basics in php right(some sql, file writing, etc). But I felt kinda lost trying to parse this text.

Hoping for some hints! Thanks!

-Fredrik

Sponsor our Newsletter | Privacy Policy | Terms of Service