[Sussex] Converting OpenOffice documents to XHTML
Steve Dobson
steve at dobson.org
Wed Nov 10 17:12:27 UTC 2004
Geoff
Thanks for the advice. I was unsure and your advice is very welcome.
Can you point me at a good howto to get me started on doing this?
Steve
On Wed, Nov 10, 2004 at 05:02:51PM +0000, Geoff Teale wrote:
> Steve,
>
> You might be better off writing some code in OO.org itself to do this -
> iterating over the sections is going to be less hairy than trying to do
> XSLT or some other XML level transformation.
>
>
> On Wed, 2004-11-10 at 16:53 +0000, Steve Dobson wrote:
> > Geoff
> >
> > On Wed, Nov 10, 2004 at 04:34:50PM +0000, Geoff Teale wrote:
> > > Steve,
> > >
> > > More detail please .. I'm now very experienced in coding in
> > > OpenOffice.org but I'm not quite clear about what you are trying to do
> > > here.
> >
> > They currently have paper based process backed up by a manual. They are
> > currently expanding the manual and with lots of new stuff but we have
> > it on the idea of converting this to a "web application".
> >
> > The manual is going to be converted into a set of XHTML pages.
> > Rather than have them cut & past the text into the database in a slow,
> > mandrollic process I would much prefer to extract on the structure
> > of the document (now in OpenOffice) and load that data into the database.
> >
> > For example where the document reads:
> >
> > 1.1 Heath and Safety
> >
> > 1.1.2. First Aid Kits
> >
> > The first aid kids need to be checked weekly to ensure each is
> > stocked with the appropriate stuff.
> >
> >
> > I would like to turn that into something like:
> >
> > <section>
> > <title>Heath and Safety</title>
> > <subsection>
> > <title>First Aid Kits</title>
> > <text>
> > The first aid kids need to be checked weekly to ensure each is
> > stocked with the appropriate stuff.
> > </text>
> > </subsection>
> > </section>
> >
> > That way I could then parse this new XML using PHP extract the various
> > bits and insert them into the appropriate rows and columns in the database.
> >
> > I haven't yet designed the database so I am flexible on who this is best
> > to be done. I'm looking here for the best way of doing this. I've seen
> > stuff on XML style conversion, but is this the way to go?
> >
> > Steve
> >
> >
> > _______________________________________________
> > Sussex mailing list
> > Sussex at mailman.lug.org.uk
> > http://mailman.lug.org.uk/mailman/listinfo/sussex
> >
> > _____________________________________________________________________
> > This e-mail has been scanned for viruses by MCI's Internet Managed Scanning Services - powered by MessageLabs. For further information visit http://www.mci.com
> >
> --
> Geoff Teale <gteale at cmedltd.com>
> Cmed Technology
>
>
> _______________________________________________
> Sussex mailing list
> Sussex at mailman.lug.org.uk
> http://mailman.lug.org.uk/mailman/listinfo/sussex
More information about the Sussex
mailing list