[Sussex] Converting OpenOffice documents to XHTML

Steve Dobson steve at dobson.org
Wed Nov 10 17:12:27 UTC 2004


Geoff

Thanks for the advice.  I was unsure and your advice is very welcome.

Can you point me at a good howto to get me started on doing this?

Steve

On Wed, Nov 10, 2004 at 05:02:51PM +0000, Geoff Teale wrote:
> Steve,
> 
> You might be better off writing some code in OO.org  itself to do this -
> iterating over the sections is going to be less hairy than trying to do
> XSLT or some other XML level transformation.
> 
> 
> On Wed, 2004-11-10 at 16:53 +0000, Steve Dobson wrote:
> > Geoff
> > 
> > On Wed, Nov 10, 2004 at 04:34:50PM +0000, Geoff Teale wrote:
> > > Steve,
> > > 
> > > More detail please .. I'm now very experienced in coding in
> > > OpenOffice.org but I'm not quite clear about what you are trying to do
> > > here.
> > 
> > They currently have paper based process backed up by a manual.  They are
> > currently expanding the manual and with lots of new stuff but we have
> > it on the idea of converting this to a "web application".
> > 
> > The manual is going to be converted into a set of XHTML pages. 
> > Rather than have them cut & past the text into the database in a slow,
> > mandrollic process I would much prefer to extract on the structure
> > of the document (now in OpenOffice) and load that data into the database.
> > 
> > For example where the document reads:
> > 
> >   1.1   Heath and Safety
> > 
> >   1.1.2.  First Aid Kits
> > 
> >         The first aid kids need to be checked weekly to ensure each is
> >         stocked with the appropriate stuff.
> > 
> > 
> > I would like to turn that into something like:
> > 
> >   <section>
> >      <title>Heath and Safety</title>
> >      <subsection>
> >         <title>First Aid Kits</title>
> >         <text>
> >             The first aid kids need to be checked weekly to ensure each is
> >             stocked with the appropriate stuff.
> >         </text>
> >      </subsection>
> >   </section>
> > 
> > That way I could then parse this new XML using PHP extract the various
> > bits and insert them into the appropriate rows and columns in the database.
> > 
> > I haven't yet designed the database so I am flexible on who this is best
> > to be done.  I'm looking here for the best way of doing this.  I've seen
> > stuff on XML style conversion, but is this the way to go?
> > 
> > Steve 
> > 
> > 
> > _______________________________________________
> > Sussex mailing list
> > Sussex at mailman.lug.org.uk
> > http://mailman.lug.org.uk/mailman/listinfo/sussex
> > 
> > _____________________________________________________________________
> > This e-mail has been scanned for viruses by MCI's Internet Managed Scanning Services - powered by MessageLabs. For further information visit http://www.mci.com
> > 
> -- 
> Geoff Teale <gteale at cmedltd.com>
> Cmed Technology
> 
> 
> _______________________________________________
> Sussex mailing list
> Sussex at mailman.lug.org.uk
> http://mailman.lug.org.uk/mailman/listinfo/sussex




More information about the Sussex mailing list