was such a bad idea. What could
Stan> this man to a browser that doesn't have this extension? On the
Stan> other hand using \ at least says to treat the text
Stan> as a paragraph even if centering is not supported.
This works because 'ALIGN' has no semantic connotation. '\
' is a
paragraph however it is aligned. But consider instead
\
\
\
If this usage spreads (which in some measure is inevitable), then HTML
could be replaced by HTML(-1), with a single
\
construct. :-)
Newsgroups: comp.text.sgml
From: kaikow@standards.com
Subject: Re: SGML book?
Originator: kaikow@mv.mv.com
Message-ID: \
Sender: kaikow@standards.com
Reply-To: kaikow@standards.com
Organization: MV Communications, Inc.
Date: Fri, 24 Nov 1995 05:33:46 GMT
References: <480i5v$hsp@usenet.ucs.indiana.edu> \ <1995Nov20.045420.29244@sq.com> <1995Nov20.214942.10037@sq.com>
Followup-To: kaikow@standards.com
I agree, but a few bookstores stock SGML books. Others will get copies of
most books for one to look at with no obligation to buy, however, most
will only do that if the book is available thru their supplier. Alas,
Practical SGML is not so available.
I don't like buying any book without first seeing it, but SGML books are
just not on store shelves everywhere.
Now, if you want HTML, there are many to choose from!
In article <1995Nov20.214942.10037@sq.com>, Liam R. E. Quin \ wrote:
>A couple of people have asked, so I just wanted to point out that the
>comments I made about various SGML books are my initial opinions based
>on a quick look (as I said in the article, but people asked anyway!).
>
>Please, if you're going to buy books, look at them, and read reviews!
>We're very fortunate that there are quite a few books on SGML, and more
>seem to be on the way, so you can choose ones that will suit you.
>
>Lee
>
>--
>Liam Quin, SoftQuad Inc +1 416 239 4801 lee@sq.com \
>HexSweeper NeWS game;OPEN LOOK+XView+mf-fonts FAQs;lq-text unix text retrieval
From: Goldfarb@InterRamp.com (Charles F. Goldfarb)
Newsgroups: comp.text.sgml
Subject: Re: SGML book?
Date: Fri, 24 Nov 1995 08:41:15 GMT
Organization: Information Management Consulting
Message-ID: <4940fd$anb@usenet1.interramp.com>
References: <480i5v$hsp@usenet.ucs.indiana.edu> \ \ <1995Nov20.045420.29244@sq.com>
Reply-To: Goldfarb@InterRamp.com
lee@sq.com (Liam R. E. Quin) wrote:
I would appreciate hearing about any errors in README.1ST or other
books in my series as I am not aware of any. Please write to
Goldfarb@SGMLSource.com.
Thanks,
Charles Goldfarb
Newsgroups: comp.text.sgml
Date: 24 Nov 1995 12:39:22 +0100
From: Stéphane Bornnerand \
Message-ID: <199511241139.MAA22270@cygne.ais.berger-levrault.fr>
References: <47togkINNoc1@oasys.dt.navy.mil>
Subject: Re: SGML Document Management System
Reply to the following article based on the product ADOC(TM) developed both
by AIS S.A. and the AVIONICS and SYSTEMS Division of AEROSPATIALE (France).
I directly put the answers to your questions within the text and additional
comments at the end of the message.
[Lori Westbrook]
| Does anyone know of any software that could be used to build an SGML
| document management system for technical manuals that satisfies the
| requirements listed below. If you know of any products that satisfy all
| or a majority of these requirements please respond by e-mail to
| "sgmldbms@oasys.dt.navy.mil". The desired system must:
| 1. maintain structure of an SGML instance.
ADOC is based on SGML/Store(TM) storage sub-system which has been designed
to store and to manipulate SGML document or collection of documents under a
database format. SGML/Store (and so ADOC) accepts arbitrary DTDs, without
requiring any specific database schema definition or settings step. A
SGML/Store (and so ADOC) database can manage heterogeneous document
collections which are instance of multiple DTDs.
| 2. maintain multiple versions of the same document without redundancy
| of data storage.
Yes.
The manager of the database is open to define the granularity of the SGML
storage management. The manager declares how an SGML document or an SGML
fragment will be managed (access rights, versioning).
Furthermore, ADOC makes distinction between "evolution" and version. A
version is a document or a collection of documents with a number and it
could no more be updated. The "evolution" keeps tracking of different
steps. ADOC maintains the database consistency and the last version is
always used through cross-references.
| 3. maintain/manage binary objects, e.g, images, ASCII text.
ADOC is able to manage both SGML instances database and illustration
database (graphics).
It is possible to store/retrieve/access also other binary objects (WORD,
EXCEL,...) in the database using external application for editing.
ADOC 2.0 will provide versioning on graphics. In ADOC 1.2, the versioning
of a graphics is linked with the SGML fragment where the graphics is
included.
| 4. provide configuration management to include
| revision management and workflow management.
ADOC 1.2 provides configration management with revision management but no
workflow management.
ADOC 2.0 will provide user-defined properties and APIs in order to be able
to easily integrate a workflow product within ADOC.
| 5. support multi-platform development and shall provide a standard
| Application Programming Interface.
ADOC 1.2 provide an SGML automatic transformation engine in order to
execute scripts to modify SGML instances.
ADOC 2.0 will provide a set of APIs and a toolkit for developing specific
gateway , customizing ADOC environment or adding new functions with plug-in
modules.
| 6. provide for client/server interfaces across platforms e.g., NT,
| Unix/Sun, Windows-based, etc.
ADOC 1.2 is client/server but it requires to mount NFS. It is running on
SUN OS.
ADOC 2.0 will be client/server with RPC on TCP. Alpha release is running on
Solaris 2. A windows-based client is planned.
| 7. be browser independent.
ADOC is editor independent but it requires a gateway to plug-in the editor
onto the ADOC workbench. In the release 1.2, ADOC provides a gateway only
for Author/Editor from SoftQuad.
| 8. export SGML files directly from the database
| for use in independent software products.
Yes. Export of one document or export of a collection of documents.
Furthermore, ADOC is able to mark the delta between the version n-i and n-j
of a document during the export process.
| 9. support all plug-ins via API, an example would be an enhanced
| graphics viewer.
ADOC 1.2 provides APIs to plugs external tools for editing or viewing.
| 10. efficiently manage 30,000 to 40,000 documents, each averaging 3MB
| of text and 17MB of graphics.
ADOC effeciently supports very large documents of several tens of Mbytes in
size, made up of several million SGML elements.
ADOC offers concurrent editing where 10 users can have concurrent access to
the same document.
ADOC 1.2 (Available)
- Documentation under CALS standards: SGML, CGM & CCITT G4
- DTD structuration: arbitrary DTD, definition of SGML elements granularity
level
- Parsing SGML instance according to its DTD
- SGML transformation engine for scipting
- Author private working space. ADOC is native concurrent editing system
- Integrated native SGML editor
- Integrated Graphic CGM editor
- Integrated Group 4 graphic editor
- Formatted manuals database
- OS CALS A & B composer with incremental formatting
- Viewer/differential viewer
- Printer/differential printer (Postscript)
- Electronic publications maker
ADOC 2.0 (2Q96)
New features
- customization of the client application
- Client/server architecture (management, import/export, printing, viewing)
- Document server based on a RDBMS
- Configuration server is based on a RDBMS
- SGML Entity management in SGML/Store (and so in ADOC)
- Generic declaration at a Class level (DTD, FOSI, filter, access rights,
granularity level,...)
- Security refinement
- APIs
For further information, contact me or Isabelle Bornier:
phone: [+33] 1 46 40 84 15
FAX: [+33] 1 46 40 84 10
email: ibor@ais.berger-levrault.fr
Regards,
Stephane Bornerand.
--
_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
_/ Stephane Bornerand phone: [+33] 1 46 40 84 06 _/
_/ AIS S.A. FAX: [+33] 1 46 40 84 10 _/
_/ 35, Rue du Pont email: sbor@ais.berger-levrault.fr _/
_/ 92200, Neuilly, FRANCE WWW: http://ais.berger-levrault.fr _/
_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
From: taube@isa.de (Christian Taube)
Newsgroups: comp.text.sgml
Subject: Re: Mapping RTF hex codes to ISO Latin entities?
Date: 24 Nov 1995 12:27:02 +0100
Organization: ISA GmbH, Azenbergstr. 35, D-70174 Stuttgart, FRG
Message-ID: <494a66$qas@dirac.isa.de>
References: <492g3n$p9d@dirac.isa.de>
Keywords: RTF, ISO Latin, mapping, character sets
I wrote yeserday:
>I'm looking for some sort of table that lists all the RTF hexadecimal
>codes -- the ones that look like \\'\ and are replaced by special
>characters and "Umlaute" according to The Microsoft Way when you
>display the document -- and their corresponding ISO 8859 entities.
>(No, I don't regularly work with RTF ... :-)
It has been pointed out to me that this is quite obvious from the fact
that I didn't indicate which version of RTF and which word processor I
am working with ... (God I'm starting to hate RTF, don't you?). OK,
so it's the 1.4 spec of RTF that I have here, and I need the map
primarily for a German MS Word 6, but English would be nice, too.
In the meantime, I'm trying to transcribe those stupid RTF characters
manually and find the entities, and I have stopped wondering why the
rtftohtml filters that I have seen did mostly a bad job ... :-)
Christian.
--
Christian Taube -- \taube@isa.de\
Any opinions expressed are mine, and mine only! Thought for today:
These are the children of the race "Enterprise." Her 5-year mission - to
logically go where no Romulan has gone before.
From: Paul Treadaway \
Newsgroups: comp.text.sgml
Subject: Re: SGML -> HTML
Date: Fri, 24 Nov 95 12:05:54 GMT
Organization: Cambridge University Press
Message-ID: <199511241214.MAA05230@sun1.cup.cam.ac.uk>
References: <48g54e$8d3@news.INbe.net> <48i0qh$16e@news.Belgium.EU.net> <48jgka$fb5@ixnews2.ix.netcom.com>
In-Reply-To: <48jgka$fb5@ixnews2.ix.netcom.com>
In article <48jgka$fb5@ixnews2.ix.netcom.com>, Mark Klamerus says...
>Jacques Deseyne \ wrote:
>>"M.T. Carrasco Benitez" \ wrote:
>>>I am looking for SGML->HTML filter, or a pointer to the FAQ where this
>>>kind of answer could be. Mail prefered.
>Actually making the required filters in UNIX scripts is very, very
>easy. There is no "generic" filter that I know of since every DTD is
>different, but if you're looking for a "tool" you might consider
>Exoterica, which is excellent.
Exoterica is the name of a company. I assume you're referring to one
of their products (e.g. Omnimark, their scripting language for SGML
conversions). Omnimark has a 4GL style interface which drives a fairly
brute force processing model, but with some nice extras such as back-
patching for more complicated tasks. Another option is AIS's Balise,
which has a more conventional interface (C++ style syntax) but comes
with a library of translation functions which operate by directly
manipulating an ESIS tree.
Newsgroups: comp.text.sgml
Date: Fri, 24 Nov 1995 14:54:02 +0100
From: Stéphane Bornnerand \
Message-ID: <199511241354.OAA25691@cygne.ais.berger-levrault.fr>
References: <47tp7u$p2j@dub-news-svc-2.compuserve.com> \
Subject: Re: SGML ----> RTF
[Jeff Webster]
| Anyone know of a tool for converting SGML to RTF. Thanks in advance.
[Alfie Kirkpatrick]
| I don't think you'll find an off-the-shelf tool for an arbitrary DTD.
| Tools like Omnimark make fairly light work of it though, and the RTF
| spec is reasonably well documented (available from the MS Web site,
| http://www.microsoft.com).
There are two ways to proceed for converting SGML to RTF.
The first one is as you proposed is to use a product such Omnimark or
Balise to convert a SGML file into a RTF file. This tool works with the
format. It is transform a representation into another. The problem here is
first that SGML instance is dependent on a DTD and, second, SGML does not
include all the data requires by RTF. You should overcome this gap by
additional programming or declaration (or both).
Note: Balise(TM) is an SGML document processing language developed and
shipping by AIS.
The second way is to use a Page-composition gateway to transform an SGML
instance into an RTF file by composing and formatting SGML instance in
order to generate RTF (or other formats).
AIS is developing such a product called DynaPage(TM).
How does it work ?
- take an SGML instance and its DTD
- create a stylesheet from the DTD in order to define the value of the
attributes of RTF styles
- compile SGML instance, DTD, and Stylesheet
- generate RTF
Advantages
- Never require programming
- Interactive stylesheet editor (simple text editor could be enough)
- Batch conversion
- Reliable process
- Target RTF styles customisation
For further information, contact me or Isabelle Bornier:
phone: [+33] 1 46 40 84 15
FAX: [+33] 1 46 40 84 10
email: ibor@ais.berger-levrault.fr
Regards,
Stephane Bornerand.
--
_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
_/ Stephane Bornerand phone: [+33] 1 46 40 84 06 _/
_/ AIS S.A. FAX: [+33] 1 46 40 84 10 _/
_/ 35, Rue du Pont email: sbor@ais.berger-levrault.fr _/
_/ 92200, Neuilly, FRANCE WWW: http://ais.berger-levrault.fr _/
_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
From: Jacques Deseyne \
Newsgroups: comp.text.sgml
Subject: Re: Mapping RTF hex codes to ISO Latin entities?
Date: 24 Nov 1995 14:34:02 GMT
Organization: SEMA Group Belgium
Message-ID: <494l4q$1tq@news.Belgium.EU.net>
References: <492g3n$p9d@dirac.isa.de> <494a66$qas@dirac.isa.de>
taube@isa.de (Christian Taube) wrote:
>I wrote yeserday:
>>I'm looking for some sort of table that lists all the RTF hexadecimal
>>codes -- the ones that look like \\'\ and are replaced by special
>>characters and "Umlaute" according to The Microsoft Way when you
>>display the document -- and their corresponding ISO 8859 entities.
>
>>(No, I don't regularly work with RTF ... :-)
>
>It has been pointed out to me that this is quite obvious from the fact
>that I didn't indicate which version of RTF and which word processor I
>am working with ... (God I'm starting to hate RTF, don't you?). OK,
>so it's the 1.4 spec of RTF that I have here, and I need the map
>primarily for a German MS Word 6, but English would be nice, too.
>
I'm not sure whether the spec version makes much difference, but the
character set defined at the start of a document will. It can be "ansi",
"mac", "pc" (IBM PC code page 437) or "pca" (code page 850). With MS-Word 6,
you have probaly only the "ansi" character set in your RTF files.
The meaning may also differ according to the font which is in use.
A single symbol may even be represented by a field like
{\\field{\\*\\fldinst SYMBOL 36 \\\\f "ZapfDingbats" \\\\s 12}{\\fldrslt\\f12\\fs24}}
There are many other caveats which don't come to mind now.
Long live vendor-independent representations!
Best regards,
------------------------------------------------------
Jacques Deseyne \
Sema Group Belgium - Stallestraat 96 - B-1180 Brussels
------------------------------------------------------
Newsgroups: comp.text.sgml
From: ak@oasis.icl.co.uk (Alfie Kirkpatrick)
Subject: Re: Mapping RTF hex codes to ISO Latin entities?
Message-ID: \
Sender: news@oasis.icl.co.uk
Reply-To: ak@oasis.icl.co.uk
Organization: ICL, Bracknell, UK
References: <492g3n$p9d@dirac.isa.de>
Date: Fri, 24 Nov 1995 14:46:30 GMT
Christian Taube (taube@isa.de) wrote:
: Sorry to bug you guys again with stupid RTF stuff, but I need to work
: this out ASAP ... :-(
Sorry this isn't a definitive answer ... :-)
: I'm looking for some sort of table that lists all the RTF hexadecimal
: codes -- the ones that look like \\'\ and are replaced by special
: characters and "Umlaute" according to The Microsoft Way when you
: display the document -- and their corresponding ISO 8859 entities.
: Does anyone know if and where such a beast exists?
I don't think it will be that simple even if such a table does
exist. Remember that some of the ISO entities need a different
font in addition to a character code, so it won't be a straight
hex code to entity name mapping; you will also need to know
the font in effect when the RTF \\'xx occurs.
You could construct your own table(s) from the entity sets given
an SGML processing tool (bread and butter for Omnimark) and then
read them back in when doing the RTF conversion.
EBT have done some work on the ISO entities to define text,
decimal (ddd;) and SDATA equivalents. For the SDATA case,
the format is something like:
\
From this you can construct the right RTF sequence to change
font and output the \\'xx code.
Incidentally, does anyone know if this format for SDATA entities
is a standard/convention or something EBT devised for themselves?
Hope this helps. Converting RTF to SGML - rather you than me!
Alfie.
--
+-Professional Publishing Services---------------+
| Alfie Kirkpatrick ICL |
| external: +44 1344 472500 Lovelace Road |
| internal: 7263 2500 Bracknell, Berks |
| mail: ak@oasis.icl.co.uk RG12 8SN |
+------------------------------------------------+
Newsgroups: comp.text.sgml
From: ak@oasis.icl.co.uk (Alfie Kirkpatrick)
Subject: Re: Mapping RTF hex codes to ISO Latin entities?
Message-ID: \
Sender: news@oasis.icl.co.uk
Reply-To: ak@oasis.icl.co.uk
Organization: ICL, Bracknell, UK
References: <492g3n$p9d@dirac.isa.de>
Date: Fri, 24 Nov 1995 14:53:01 GMT
Christian Taube (taube@isa.de) wrote:
: I'm looking for some sort of table that lists all the RTF hexadecimal
: codes -- the ones that look like \\'\ and are replaced by special
: characters and "Umlaute" according to The Microsoft Way when you
: display the document -- and their corresponding ISO 8859 entities.
: Does anyone know if and where such a beast exists?
Just ignore my previous reply - I didn't read the subject line
properly (about Latin entities). Here is a file mapping the
ISOlat1 entity set to decimal characters.
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
\
--
+-Professional Publishing Services---------------+
| Alfie Kirkpatrick ICL |
| external: +44 1344 472500 Lovelace Road |
| internal: 7263 2500 Bracknell, Berks |
| mail: ak@oasis.icl.co.uk RG12 8SN |
+------------------------------------------------+
Newsgroups: comp.text.sgml
Date: 24 Nov 1995 17:27:56 +0100
From: Stéphane Bornnerand \
Message-ID: <199511241627.RAA01609@cygne.ais.berger-levrault.fr>
References: \
Subject: Re: SGML and Page Composition
[Jeanne Fritch]
| I am looking for people using SGML and still have important Page Layout
| requirements. The questions I have are:
Here, I will describe a Page-composition gateway which is able to generate
input to page makeup systems from SGML data. The page layout could then be
precisely designed and printed under these systems.
Requirements addressed by the Page-compositon gateway include: 1) a style
sheet mechanism with "propagating" styles and SGML context sensitivity; 2)
a powerful set of data manipulation functions for data filtering,
reordering, and generatione; 3) a database implementation, capable of
scaling to very large documents without any concern for document size; and
4) sufficient typographic sophistication in composition/layout algorithms
(in the page makeup system).
| 1. Describe the process from SGML to output.
Let me take FrameMaker as page makeup system.
Preparation
- take an SGML instance and its DTD
- create a stylesheet from the DTD in order to define the value of the
attributes of RTF styles
- compile SGML instance, DTD, and Stylesheet
- generate MIF
- open MIF with FrameMaker
- Enhance style definitions and define page layout under FrameMaker
- save FrameMaker stylesheet
Production (automating the process)
- compilation of SGML instance, its DTD, and its "basic" stylesheet
- generatation of MIF
- Loading MIF and its "enhanced" stylesheet under FrameMaker
and
- Printing or doing what you need !
| 2. What tools ae you using?
In order to handle the complete Page-Composition layout process from SGML
data as detailled in 1, you should use :
- DynaText for compiling SGML
- DynaPage for generating MIF file
- FrameMaker for composing and printing
AIS is developing the product DynaPage(TM). DynaPage is a compagnon product
of DynaText from EBT.
| 3. Is it a batch process or does a lot of work need to be done on
| the information at publishing time.
Yes. it is a batch process on production (at publishing time !).
| 4. What was the process of setting up the environment?
Please explain.
| 5. Do you deal with A-pages and how?
If the question is: Do you deal with A3, A4, ... ?
The answer is: Yes.
How: under FrameMaker.
| 6. Tips you can suggest for someone starting the setup process.
If I can answer to 4 than I will be able to suggest tips.
A Page-composition gateway based on DynaPage(TM) product could generate
input to popular document composition systems, including FrameMaker(TM),
Interleaf(TM) and MS-Word(TM).
DynaPage is running under Unix.
I can send by E-mail the electronic form of the DynaPage documentation on
request. Don't hesitate to ask for.
For further information, contact me or Isabelle Bornier:
phone: [+33] 1 46 40 84 15
FAX: [+33] 1 46 40 84 10
email: ibor@ais.berger-levrault.fr
Regards,
Stephane Bornerand.
--
_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
_/ Stephane Bornerand phone: [+33] 1 46 40 84 06 _/
_/ AIS S.A. FAX: [+33] 1 46 40 84 10 _/
_/ 35, Rue du Pont email: sbor@ais.berger-levrault.fr _/
_/ 92200, Neuilly, FRANCE WWW: http://ais.berger-levrault.fr _/
_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
From: voodoo@mulberry.com (Dennis Stein)
Newsgroups: comp.text.sgml
Subject: {{{{{{{{{{{ Your Web Page! }}}}}}}}}}}
Date: Fri, 24 Nov 1995 19:02:30 GMT
Organization: Uniquest On-Line Services
Message-ID: <495306$3g8@news.mulberry.com>
Send us an email with your rough ideas of what you want on a web page,
and we'll e-mail you a sample page of what your business will look
like on the WWW.
We also do Graphic Design, Data Recovery, Searches, Tourguiding, etc.
all by E-mail!!!
Send E-mail to:
voodoo@mulberry.com
HyperNet Information Systems
From: voodoo@mulberry.com (Dennis Stein)
Newsgroups: comp.text.sgml
Subject: {{{{{{{{{{{ Your Web Page! }}}}}}}}}}}
Date: Fri, 24 Nov 1995 19:05:33 GMT
Organization: Uniquest On-Line Services
Message-ID: <49535v$3g8@news.mulberry.com>
Send us an email with your rough ideas of what you want on a web page,
and we'll e-mail you a sample page of what your business will look
like on the WWW.
We also do Graphic Design, Data Recovery, Searches, Tourguiding, etc.
all by E-mail!!!
Send E-mail to:
voodoo@mulberry.com
HyperNet Information Systems
Newsgroups: comp.text.sgml
From: lee@sq.com (Liam R. E. Quin)
Subject: Re: SGML book?
Message-ID: <1995Nov24.181139.24412@sq.com>
Organization: SoftQuad Inc., Toronto, Canada
References: <480i5v$hsp@usenet.ucs.indiana.edu> \ <1995Nov20.045420.29244@sq.com> <4940fd$anb@usenet1.interramp.com>
Date: Fri, 24 Nov 1995 18:11:39 GMT
Charles F. Goldfarb \ wrote:
>lee@sq.com (Liam R. E. Quin) wrote:
> I would appreciate hearing about any errors in README.1ST or other
> books in my series as I am not aware of any.
E.g. saying that \ is a paragraph separator in HTML.
The whole HTML section at the back is based on a draft DTD dated from 1993
that was a discussion piece, and is not the correct DTD for HTML.
(I realise that RFC1866 wasn't published in time for the book, of course)
I didn't find any errors in the SGML part (and didn't claim to) -- but I
have only read the HTML part in detail. I do think most of what I have
read is well-written, but being two years out of date when you're writing
about the Word Wide Web is not a good idea :-(
Actually, I should probably praise README.1ST for including anything
at all about the Web -- Travis and Waldt included much less in their book,
and (I think misleadingly) called HTML an `sgml-like language' [p.55] and
also says `HTML is not really SGML' [p.56].
But I don't want to get into details of the content -- my earlier article
(as it said) was about first impressions, and I wrote it partly to alert
people to more SGML books, and partly to try & get some discussion going.
> Please write to Goldfarb@SGMLSource.com.
(I've cc'd that mail address)
Lee
--
Liam Quin, SoftQuad Inc +1 416 239 4801 lee@sq.com \
HexSweeper NeWS game;OPEN LOOK+XView+mf-fonts FAQs;lq-text unix text retrieval
From: Paul Madsen \
Newsgroups: comp.text.sgml
Subject: CALS DTD table attributes
Date: 24 Nov 1995 18:31:59 GMT
Organization: Newbridge Networks Corporation
Message-ID: <49532v$it1@kannews.ca.newbridge.com>
I'd like to get some info on the CALS table (and its subelements) attributes.
Is there an archive defining the purpose of each of the attributes somewhere?
Thnaks for any help
Paul Madsen
Online Documentation
Newbridge Networks Corp
From: jang@xs4all.nl (Jan Grootenhuis)
Newsgroups: comp.text.sgml
Subject: Re: HyTime and OpenDoc and OLE
Date: 25 Nov 1995 15:00:04 GMT
Organization: XS4ALL, networking for the masses
Message-ID: <497b1k$21j@news.xs4all.nl>
References: \
Keywords: HyTime,OpenDoc,OLE
rs@cci.com (Rick Silterra) writes:
>I have been asked the question: How do HyTime, and OpenDoc relate?
I propose you read Liora Alschuler's beautiful book:
ABCD... SGML, a user's guide to structured information,
International Thomson Computer Press, ISBN 1-850-32197-3
It compares SGML-WYSIWYG-DTP-Databases-GML-ODA-Opendoc-OLE-RTF-PostScript-
PDF-HTML-HyTime on p. 31-38.
Enjoy!
--
Jan Grootenhuis Kralenbeek 1873 1104 KJ AMSTERDAM The Netherlands
Tel/fax (+31)20-699.89.66 Internet jang@xs4all.nl
From: jang@xs4all.nl (Jan Grootenhuis)
Newsgroups: comp.text.sgml
Subject: Re: Mapping RTF hex codes to ISO Latin entities?
Date: 25 Nov 1995 15:08:48 GMT
Organization: XS4ALL, networking for the masses
Message-ID: <497bi1$29p@news.xs4all.nl>
References: <492g3n$p9d@dirac.isa.de> <494a66$qas@dirac.isa.de>
Keywords: RTF, ISO Latin, mapping, character sets
taube@isa.de (Christian Taube) writes:
>so it's the 1.4 spec of RTF that I have here, and I need the map
>primarily for a German MS Word 6, but English would be nice, too.
Did you check your local MicroSoft BBS? They normally that sort of stuff.
Tschuess,
--
Jan Grootenhuis Kralenbeek 1873 1104 KJ AMSTERDAM The Netherlands
Tel/fax (+31)20-699.89.66 Internet jang@xs4all.nl
From: tore@lis.pitt.edu (Tore Joergensen)
Newsgroups: kaikow@standards.com,comp.text.sgml
Subject: Re: SGML book?
Date: 25 Nov 1995 18:39:32 GMT
Organization: University of Pittsburgh
Message-ID: <497nt4$eem@toads.pgh.pa.us>
References: <480i5v$hsp@usenet.ucs.indiana.edu> \ \
kaikow@standards.com wrote:
: Is the Alschuler book better than the Practical SGML book by Van
: Herwijejnen? If so, how?
: In article \, Peter Flynn \ wrote:
: >Dixit pyachnes@ucs.indiana.edu (paul alan yachnes):
: >> Can someone recommend the best book for learning SGML?
: >The best one I can recommend is "ABCD... SGML: A User's Guide
: >to Structured Information" by Liora Alschuler (ISBN: 1-850-32197-3)
"ABCD...SGML" is not a book for learning SGML. The explanation of the
SGML syntax is very brief. This book tries to explain the big picture,
rather than to learn you how to write SGML.
"Practical SGML" by Eric van Herwijnen is a book for learning SGML,
but in my opinion not a very good one. It is to much like a reference
manual... 'This is everything you can do with this,... now let's see
on the next structure in the language'. I don't think the explanation
of everything is as good as it should be either.
"The SGML Handbook" by Charles F. Goldfarb is a very complete book.
I don't know how it is to learn from, but it is very good as a
reference manual. My impression (after reading a few sections :-)
is that it probably is easier to learn from, than "Practical SGML".
"The Complete SGML"-CD from Exoterica is the ISO-standard with some
explanations, and many example documents. In my opinion the interface
sucks. I would not try to learn from this.
There are a few other books, and SoftQuad have a cheap CD, but I haven't
seen any of them and can of course not comment on them.
--
+-------------------------+-------------------------------------------+
| Tore B. Joergensen | e-mail : tore@lis.pitt.edu |
| Centre Court Villa | web : http://www.pitt.edu/~tojst1 |
| 5535 Centre Avenue # 6 | |
| Pgh, PA 15232, USA | Norwegian MSIS-student at Univ. of Pgh. |
+-------------------------+-------------------------------------------+
From: jmcd@info.sims.berkeley.edu (Jerry McDonough)
Newsgroups: comp.text.sgml
Subject: Content Model Blues
Date: 25 Nov 1995 21:26:07 GMT
Organization: School of Information Management and Systems, UC Berkeley
Message-ID: <4981lf$qcj@agate.berkeley.edu>
I've got what I thought was a fairly simple content model problem,
that's turning uglier every time I look at it. I suspect I'm missing
something fundamental here. If anyone can shed some enlightment
on me, I'd appreciate it.
The Problem:
I've got an element (let's call it HAIRY) consisting of three
subelements, a, b, & c. Element a is optional, and may only occur once
(e.g., a?). Element b is mandatory and may only occur once (e.g, b), and
element c is optional and may occur as many times as it wants (e.g, c*).
The subelements may occur in any order they want, so the following:
\
\data\\data\\data\\data\\data\\data\
\
should be perfectly valid.
Getting the constraints I want on the repeatability of the subelements
is pretty straightforward if I impose a specific order on them, like
\
but how do I get the constraints on repeatability *without* restricting
the order in any fashion?
Thankful for any help I can get,
Jerry McDonough -- jmcd@info.SIMS.Berkeley.EDU | (......)
UCB School of Information Management & Systems | \\ * * /
| \\ <> /
"Don't worry. I know what I'm doing!" | \\ -- / SGNORMPF!!!
-- From the Famous Last Words file | ||||
From: bhoffman@cerfnet.com
Newsgroups: comp.text.sgml
Subject: SGML Contracts/Perm. Jobs
Date: Sat, 25 Nov 95 10:33:48 PDT
Organization: CERFnet
Message-ID: \
Greetings -
I have several career opportunities for SGML experts.
My client is seeking several talented, skilled
individuals to join a very dynamic, SGML-based
publishing team. On the contract side, we are
interested in individuals who can provide
expertise in use of Electronic Book Technologies (EBT)
tools. On the permanent side, we are looking for
Content Management and Production Management people.
For more information and consideration, please contact me:
netWorkers, inc.
415 254-2350
- Bob Susskind
From: pyachnes@ucs.indiana.edu (paul alan yachnes)
Newsgroups: kaikow@standards.com,comp.text.sgml
Subject: Re: SGML book?
Followup-To: kaikow@standards.com,comp.text.sgml
Date: 26 Nov 1995 16:26:57 GMT
Organization: Indiana University
Message-ID: <49a4gh$idn@usenet.ucs.indiana.edu>
References: <480i5v$hsp@usenet.ucs.indiana.edu> \ \ <497nt4$eem@toads.pgh.pa.us>
I began this thread way back when, before I had done any research on the
subject. Subsequently, I discovered theat QUE will be publishing "Using
SGML" at the end of January (listed in Books in Print as Oct. '95). Since
I was looking for a "how to" book for the purpose of self-instruction, I
look forward to seeing this. The books in this series are consistently
pretty good. I thought others might be interested.
Paul Yachnes
From: liora@delphi.com
Newsgroups: comp.text.sgml
Subject: Re: SGML book?
Date: Sun, 26 Nov 95 11:55:22 -0500
Organization: Delphi (info@delphi.com email, 800-695-4005 voice)
Message-ID: \
References: <480i5v$hsp@usenet.ucs.indiana.edu> \ <1995Nov20.045420.29244@sq.com> <1995Nov20.214942.10037@sq.com> \
\ writes:
>I agree, but a few bookstores stock SGML books. Others will get copies of
>most books for one to look at with no obligation to buy, however, most
>will only do that if the book is available thru their supplier. Alas,
>Practical SGML is not so available.
Excerpts from ABCD... SGML are available at:
http://www.thomson.com/itcp/liora/LA_HOME.HTML
These are in HTML. There will soon be excerpts (the entire Resource Guide)
available in SGML as a WebExtra at www.thomson.com/itcp/itcp.html
Please sample.
Liora
From: jbw@bigbird.bu.edu (Joe Wells)
Newsgroups: comp.text.sgml
Subject: Re: Content Model Blues
Date: 26 Nov 95 19:47:24 GMT
Organization: Boston University Computer Science Department
Message-ID: \
References: <4981lf$qcj@agate.berkeley.edu>
In-reply-to: jmcd@info.sims.berkeley.edu's message of 25 Nov 1995 21:26:07 GMT
In article <4981lf$qcj@agate.berkeley.edu> jmcd@info.sims.berkeley.edu (Jerry McDonough) writes:
I've got an element (let's call it HAIRY) consisting of three
subelements, a, b, & c. Element a is optional, and may only occur once
(e.g., a?). Element b is mandatory and may only occur once (e.g, b), and
element c is optional and may occur as many times as it wants (e.g, c*).
The subelements may occur in any order they want, so the following:
\
\data\\data\\data\\data\\data\\data\
\
should be perfectly valid.
Hmm, nice puzzle. The elements can occur in these numbers in any order:
a: 0 or 1
b: 1
c: 0, 1, 2, ...
If elements a, b, and c have declared content CDATA, RCDATA, or EMPTY,
then we can just do this and be done:
\
However, using an inclusion exception to allow element c to appear
anywhere can screw you up if one of the elements a, b, or c allow included
elements to appear. You could get around this by having exclusion
exceptions for a, b, and c, but this can be a problem as well.
The more general solution goes as follows. The way the elements can
appear is described by this state machine:
_c_ _c_
/ \\ / \\
| | | |
\\ V V /
--a0,b0 ---b---> a0,b1--
| |
| |
a a
| |
| |
V V
--a1,b0 ---b---> a1,b1--
/ ^ ^ \\
| | | |
\\ / \\ /
-c- -c-
I've given the states names that indicate how many a or b elements have
been seen so far. The state "a0,b0" is the start state and the states
with "b1" in their name are accept states.
That corresponds to this regular expression:
c*((ac*bc*)|(bc*(ac*)?)
I'll need to play with this a bit to put it in SGML syntax. This should
be a valid content model for what you want:
(c*, ((a, c*, b, c*) | (b, c*, (a, c*)?)))
By the way, for your amusement, the proposed DTD for HTML 3.0 faced the
same issue and handled it wrong. As a result, the following is not valid
HTML 3.0:
\
\
\
\ Here is the title. \
\
\
\
\ Here is the body. \
\
\
In other words, you can't have a META element both before and after a
TITLE element. This contradicts the prose of the proposal, which states
that the content of HEAD can appear in any order. (This doesn't really
matter since the HTML 3.0 proposal has expired.)
The HTML 2.0 DTD botched the handling of this issue in a different way.
The following is *valid* by the HTML 2.0 DTD:
\
\
\
Here is
\
the title.
\
\
\
\ Here is the body. \
\
\
This is bad because the TITLE element is supposed to contain only data
characters. It is not clear to me why TITLE was not given a declared
content of RCDATA to avoid this problem.
--
Enjoy,
Joe Wells \
Newsgroups: comp.text.sgml
From: lee@sq.com (Liam R. E. Quin)
Subject: Re: Content Model Blues
Message-ID: <1995Nov26.210607.26126@sq.com>
Organization: SoftQuad Inc., Toronto, Canada
References: <4981lf$qcj@agate.berkeley.edu> \
Date: Sun, 26 Nov 1995 21:06:07 GMT
Joe Wells \ wrote:
[...]
>The HTML 2.0 DTD botched the handling of this issue in a different way.
>The following is *valid* by the HTML 2.0 DTD:
>
> \
> \
> \
> Here is
> \
> the title.
> \
> \
> \
> \ Here is the body. \
> \
> \
There were supposed to be exclusions on TITLE, but they got left off
by mistake in the draft.
> It is not clear to me why TITLE was not given a declared
> content of RCDATA to avoid this problem.
Where were you when the HTML Working Group was dealing with this? :-)
There is no good reason, and it's better than the method that I proposed
(using exclusions) that Dan left off. Actually I thought this was fixed
in the final DTD, but I don't think it matters too much really.
There are some people on the WG who'd rather not deal with CDATA
or RCDATA elements. Once you have EMPTY elements, I'm not sure why it
makes a difference, since you now have an irregular syntax anyway.
The earlier versions were either ambiguous, or disallowed
\
\....
\....
\...
\
and since Internet Assistant creates HTML like this (the second META is
an advert for the product, as I recall), it had to be fixed fairly quickly.
Lee
--
Liam Quin, SoftQuad Inc +1 416 239 4801 lee@sq.com \
HexSweeper NeWS game;OPEN LOOK+XView+mf-fonts FAQs;lq-text unix text retrieval
Newsgroups: comp.text.sgml
From: paul@arbortext.com (Paul Grosso)
Subject: Re: CALS DTD table attributes
Message-ID: <1995Nov26.011104.21915@arbortext.com>
Sender: news@arbortext.com
Organization: ArborText, Inc. 1000 Victors Way Ann Arbor, Michigan 48108
Date: Sun, 26 Nov 1995 01:11:04 GMT
> From: Paul Madsen \
> Subject: CALS DTD table attributes
> Date: 24 Nov 1995 18:31:59 GMT
>
> I'd like to get some info on the CALS table (and its subelements) attributes.
> Is there an archive defining the purpose of each of the attributes somewhere?
SGML Open has written a Technical Memorandum about the CALS Table DTD
including a discussion of the semantics of elements, attributes, and
attribute values. Though not available to non-members during drafting
and discussion, once completed (as this one is), SGML Open Technical
Committee documents are generally available upon request. We are in
the process of making such output available in various electronic
forms on various servers. At the present time, the document you would
want, TM 9502:1995, is available in hardcopy (and, perhaps, as we
speak, in varous electronic forms) upon request to the SGML Open
Executive Director, Mary Laplante at laplante@sgmlopen.org.
The Entity Management (aka Catalog) Technical Resolution TR 9401:1995
is also similarly available. When I have the online details, I will
make a more formal announcement.
paul
Paul Grosso
VP Research, ArborText, Inc. and Chief Technical Officer, SGML Open
Email: paul@arbortext.com
--
-bill
From: connolly@w3.org (Dan Connolly)
Newsgroups: comp.text.sgml
Subject: Re: Content Model Blues
Date: 26 Nov 1995 21:24:34 GMT
Organization: Massachvsetts Institvte of Technology
Message-ID: \
References: <4981lf$qcj@agate.berkeley.edu> \
In-reply-to: jbw@bigbird.bu.edu's message of 26 Nov 95 19:47:24 GMT
In article \ jbw@bigbird.bu.edu (Joe Wells) writes:
>
> The HTML 2.0 DTD botched the handling of this issue in a different way.
> The following is *valid* by the HTML 2.0 DTD:
>
> \
> \
> \
> Here is
> \
> the title.
> \
> \
> \
> \ Here is the body. \
> \
> \
>
> This is bad because the TITLE element is supposed to contain only data
> characters.
This was fixed late in the game, but it was fixed. The TITLE element
has an exclusion exception for META and LINK.
It's kinda kludgy. I need to study up on inclusions/exclusions and get
it right for the next rev. Some FORMs related stuff is done with
inclusion exceptions in ways that lead to strange situations.
Running nsgmls on the above instance yields:
connolly@beach ../connolly[692] html-validate test.html
test.html:6:39:E: end tag for `TITLE' omitted, but its declaration does not permit this
test.html:4:7: start tag was here
test.html:8:14:E: end tag for element `TITLE' which is not open
Be sure you have the right version of html.dtd:
$Id: html.dtd,v 1.30 1995/09/21 23:30:19 connolly Exp $
Dan
--
Daniel W. Connolly "We believe in the interconnectedness of all things"
Research Scientist, MIT/W3C PGP: EDF8 A8E4 F3BB 0F3C FD1B 7BE0 716C FF21
\ http://www.w3.org/hypertext/WWW/People/Connolly/
From: kimber@passage.com (W. Eliot Kimber)
Newsgroups: comp.text.sgml
Subject: Re: HyTime and OpenDoc and OLE
Date: 26 Nov 1995 23:09:21 GMT
Organization: Passage Systems, Inc.
Message-ID: <49as31$8vp@pubxfer4.news.psi.net>
References: \
Keywords: HyTime,OpenDoc,OLE
In article \, rs@cci.com says...
>
>I have been asked the question: How do HyTime, and OpenDoc relate?
>I was somewhat flummoxed. My first reaction is that HyTime is
>1) HyTime is an interchange format, which describes links and time and space.
True.
>2) OpenDoc/OLE is a presentation format which describes the
>display of information - it is the multi-media equivalent of a wysiwyg editor.
Close, but not entirely accurate. Although I don't claim to be an authority on
these technologies, as I understand them, both OLE and OpenDoc provide
the following distinct technologies:
A. Inter-application APIs for exchanging data and for allowing one application
to intercommunicate with another, e.g., put a graphic editing window "inside"
a word processor window or put a video viewer inside a text browser.
B. Data storage mechanisms that optimize the storage and retrieval of
media objects and application data.
>3) A HyTime document might serve as an interchange format for an OpenDoc
> document.
This is more or less my take. Specifically, the HyTime parts of SGML documents
describe the *relationships* between objects. These descriptions are purely
semantic and not programatic or presentational (at least not to the degree
that is normally needed to present things), in exactly the same way that
SGML elements are descriptive and semantic and, normally, largely divorced
from the details of presentation or processing. This is why no SGML document
is complete without a complementary description of how to processes it,
i.e., a "style sheet".
OpenDoc or OLE, acting at the application level, can be ways of expressing
or *implementing* the relationships represented by HyTime hyperlinks or
event schedules, as well as more basic relationships defined by SGML itself,
such as references to data entities.
For example, consider the typical SGML practice of "including" graphics by
declaring an element type that has an ENTITY attribute intended to refer to
data entities containing graphics, e.g.:
\
\
An instance might look like this:
\
\
]>
\
...
\
...
\
The AVO element has established a relationship between the document
and the video object "vid-a.avi". However, nothing in the document itself
says what to do with this relationship or how to express it. The most
we know is the data content notation of the object, namely AVI video.
However, to present this, it would make perfect sense for an editor or
browser application to use OLE or OpenDoc objects and services to integrate
the AVI viewer with the browser or editor application. This is a
satisfactory and useful expression of the general relationship described
by the SGML as interpreted by the processing application. The use of OLE
is in no way at odds with the requirements or principles of SGML.
You could apply the same technique to HyTime hyperlinks. For example, I
might have an OLE object that can take the markup for a hyperlink and
provide the appropriate resolution and user interface functions needed
to express the relationship defined by the hyperlink.
A bigger problem is the storage side of OLE and OpenDoc. SGML (and by
extension, HyTime) makes a clear separation between the storage, data,
and presentation layers of a system. However, both OLE and OpenDoc collapse
the data and storage layers and tie them to the presentation layer. You can use
OLE and OpenDoc without using their storage services, but if you do so,
you run the risk of losing the all-important separation of the data layer
that SGML defines (and from which most of its benefits accrue).
HyTime attempts to resolve this issue by defining a general storage
manager mechanism that lets you declare entities that are stored as
OpenDoc bento containers or whatever the corresponding thing is in OLE
(or any other storage manager you might want to use). This provides a clear
and formal way to recover the separation between storage, data, and
presentation while still providing ways to take advantage of the
optimizations these storage mechanisms provide.
One thing to keep in mind when discussing the relationship of object-oriented
technologies to SGML and related application architectures is that SGML
is *data centric* while most (if not all) object-oriented disciplines
are *program centric*. This difference in emphasis can lead to subtle but
serious misunderstandings or mistakes.
For example, in object-oriented programming, objects have methods and data,
where the methods are generally fixed properties and the data varies. In other
words, an object tightly binds its data to its methods. However, in SGML,
we expect objects (documents, elements, and data entities) to be loosely
bound to their methods (style or processing) so that we can change the
methods applied at will. In SGML, while the relationship between the data
and the methods is the same as it is in an object-oriented system, SGML's
definition of the relationship will be more indirect, while in an object-
oriented system, the relationship is very direct (in fact, the objects
might simply be direct images of the C data structures used to represent
the objects inside the program).
In other words, in a data centric world, the data is immutable and the
methods change, while in a processing-centric work, the processing is
immutable and the data changes.
This difference in view reflects the interests of the people using the tools.
SGML is a tool of information owners and they therefore see the world from
a data centric view. Object-oriented programming is a tool of programmers
who own algorithms and they therefore see the world from a processing-centric
view. However, we know from experience that it is data that generally has the
greater long-term value and longevity and therefore tends to demand and
reward a data-centric view over the long term.
Thus, if you consider yourself a data-centric person (and I presume you are
if you are embracing SGML as a useful technology), then you should ask the
following question when evaluating any particular implementation or enabling
technology: "to what degree does this technology wrest from me, the data
owner, my rights to define the nature of my data and control its destiny."
For OLE and OpenDoc, the answer to this question depends on how you actually
use the technology, and can, as I've outlined above, be "not at all" but can
also be "almost completely". Of course, there will always be practical reasons
to give up some control over your data to those who supply the tools for working
with it, but the existance and nature of SGML implies, and I certainly feel,
that attempts by tools to usurp our rights to control our own data should be
resisted to the greatest degree possible.
As Tim Bray pointed out in his marvelous talk at SGML '94, the MIS world
learned this lesson for databases 20 years ago. When will the rest of the
world?
--
\
W. Eliot Kimber (kimber@passage.com) Systems Analyst and HyTime Consultant
Passage Systems, Inc., 2608 Pinewood Terr., Austin TX 78757 (512)339-1400
10596 N. Tantau Ave, Cupertino CA, 95014, (408) 366-0300
"SGML or die"
\
Newsgroups: comp.text.sgml
From: mlt@netcom.com (Marcy Thompson)
Subject: Re: Program to take SGML source and produce a DTD
Message-ID: \
Organization: Netcom Online Communications Services (408-241-9760 login: guest)
References: <48ihmj$bge@newsbf02.news.aol.com> <48nd2s$25j@news2.delphi.com>
Date: Sun, 26 Nov 1995 21:09:12 GMT
Sender: mlt@netcom8.netcom.com
Jeffrey McArthur writes:
>This really does not differ much from what *should* be done during document
>analysis. When you sit down to develop a DTD for a set of legacy documents
>you have to do the analysis of what is actually in the documents. You
>could tag the documents at the same time you do the analysis. You need to
>keep track of what tags are used where, putting the tags into the sample
>documents at the same time keeps track of that for you.
No no no. This description of "what *should* be done during document
analysis" is *way* too common (it appears in at least two of the three
new books on SGML, for example).
Here's my take on what should happen when you are ready to develop a
DTD for an existing set of documents:
0. You better know why you are doing this, but assume you do
1. Examine the documents. Determine every kind of structure and
content which you must label.
2. Generate some possible information architectures which provide
the structural and content labels you require.
3. Analyze these candidates in light of your requirements and select
one.
If you start tagging as you go along, you will discover that you've
built into the tags you used a bunch of assumptions, some of which
might be useful but some of which are certain not to be.
Do not be deceived into believing that document analysis is the act of
"recording what is there". It is the act of expanding your vision to
see what is there and what is not and then creatively designing an
architecture that supports both your notions of what is there and
your ideas of what is not there. Designing a DTD is a profoundly
creative act, and if you approach it as the task of "tagging what
I see", you will get the wrong DTD, I promise.
For samples of what I mean, I refer y'all to three sources:
(1) The Pinnacles DTDs, developed by Atlis Consulting Group (working
with the Pinnalces companies) contain a feature called "source and
reflection". Anyone who tried to write a DTD by tagging as he went
along would *never* have come up with this.
(2) My collegue, Kate Hamilton, will be doing a poster at the SGML
'95 conference the week after next wherein she will discuss alternate
tagging schemes, which is really what I'm talking about here. I don't
know the title of her poster, and I'm pretty sure that the poster does
not yet exist in an on-line form.
(3) The IBMIDDOC architecture is full of things that are the result of
careful and deep thought about what information is, how it is used in
technical documents, and how that understanding is sometimes expressed
in formatting. I recommend it if you want to see the creative and deep
thought that goes into producing a really useful DTD. (I believe
that there is a copy of the architecture document at the ifi.uio.no
FTP site; I'm sure someone will correct me if I'm wrong. :-)
Marcy
--
Marcy Thompson
work: marcy@passage.com
play: marcy@squirrel.com mlt@netcom.com
From: davido@apocalypse.org (David Ornstein)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: Re: Slight flame on SGML
Date: Mon, 27 Nov 1995 00:34:18 GMT
Organization: InterNex Information Services 1-800-595-3333
Message-ID: <49b12d$ctm@voyager.Internex.NET>
References: <47d5ro$o3j@hamilton.maths.tcd.ie> \ <47o0tp$fjb@icarus.convex.com> <19951108T121248Z@naggum.no> <47tn8b$m7r@news.iastate.edu> <487tql$mus@dove.nist.gov> <48bv00$79@doc.jmu.edu> \ <48q6u9$5d3@dove.nist.gov> \ <48t627$fr6@voyager.Internex.NET> <48vu69$dt6@rap.SanDiegoCA.ATTGIS.COM> \
pcg@aber.ac.uk (Piercarlo Grandi) wrote:
>>>> On 22 Nov 1995 19:37:45 GMT, swf@nehwselsegundoca.attgis.com (Stan
>>>> Friesen) said:
>Stan> In article <48t627$fr6@voyager.Internex.NET>,
>Stan> davido@apocalypse.org (David Ornstein) writes:
>David> At the risk of stepping out having not read all the important
>David> documents in detail, let me ask about style sheets and markup. It has
>David> seemed to me that the CLASSes being added to the various tags (e.g.,
>David> \) are really a poor-man's way to add new tags (e.g.,
>David> \). Is this basically right?
>In practice yes: in theory it is a monstrosity. CLASSes ought to be used
>exclusively to differentiate between different flavours of the same
>thing, not to extend stealthily the range of HTML elements. Even if some
>HTML3 papers have examples like '\', it is an
>abomination.
OK. I see that my intuition was basically on target. I sense that
this 'abomination' is a coming train.
>The main problem is that the values that the 'CLASS' attribute can take
>are not part of the HTML standard at all, therefore they are arbitrary,
>and therefore they should be only used to specify rendering
>possibilities, not structure (for structure _must_ be
>document-independent).
A very slippery slope. This is such a fine distinction that I think
we're going to have a lot of trouble. Just look at the level of
knowledge (WRT HTML) of the average reader of this group (ciwah).
People are going to start using CLASS for things that really should be
semantic (e.g., Appendix). For more evidence, see the discussion in a
different thread about character markup vs heading tags...
>Stan> Yes - with the caveat that they *also* provide a way to specify a
>Stan> reasonable default behavior.
>Let me add: a reasonable _HTML_ default behaviour. Because one of the
>important consideration in the HTML3 design has been backwards
>compatibility with HTML2 implementations.
We could (?) try the same trick that Netscape uses for frames. You
have a head section that contains the content for browsers capable of
handling mappings from, say, \
to \. Generally, that
trick is a coin that we could probably spend once to make a big leap
in capability in HTML and still preserve reasonable behavior in
existing browsers. I'm not sure about this...
>This has obvious benefits in terms of continuity, as compared to "OK,
>HTML was small and fun, good to start with, but now there is no problem
>in switching to full SGML-capable browsers".
>The obvious snag is that one is then encouraged to do things like:
>Stan> That is using, or instance, \ says that this
>Stan> is an appendix title, *and* that it can be safely considered to be
>Stan> equivalent to a second level header without doing damage to the
>Stan> document content.
>Which, as argued above, strictly speaking, are horrors.
Agreed. But this may just be a fact of life as HTML can only evolve
-- I doubt revolution (say full SGML) is likely to work broadly...
>Stan> Saying \ says this is an appendix, and who the heck *knows*
>Stan> how to treat it if it cannot be treated as an appendix title.
>Which would be the right thing to do, _if_ \ were part of
>HTML's content model.
Right.
>Stan> This difference - the mechanism for attaching default meaning - is
>Stan> the critical one that makes it a useful and *safe* means of
>Stan> extending HTML.
>Again, the point is that _really_ 'CLASS' should not imply any new
>"meaning", just customization of an existing thingie.
>Stan> The lack of this is why \ was such a bad idea. What could
>Stan> this man to a browser that doesn't have this extension? On the
>Stan> other hand using \ at least says to treat the text
>Stan> as a paragraph even if centering is not supported.
>This works because 'ALIGN' has no semantic connotation. '\
' is a
>paragraph however it is aligned. But consider instead
> \
> \
> \
See above.
>If this usage spreads (which in some measure is inevitable), then HTML
>could be replaced by HTML(-1), with a single
> \
>construct. :-)
I used to point out to people that they could easily collapse all
their object models down to: Object-related-to-Object. Voila! You've
covered everything.
It's useless, of course...
---------------------------------------------
David Ornstein
BrowserCaps: http://objarts.com/bc
Outbreak: http://ichiban.objarts.com/outbreak-unreg
Personal Info: http://objarts.com/davido
From: Bruce Hunter \
Newsgroups: comp.text.sgml
Subject: PRODUCT ANNOUNCEMENT - SGML processing language with free tools
Date: 27 Nov 1995 02:18:51 GMT
Organization: Direct Connection (Call 0181 297 2200 for info)
Message-ID: <49b76b$a4v@newsgate.dircon.co.uk>
SGMLC - AN SGML DOCUMENT PROCESSING LANGUAGE, WITH FREE COMPILER AND
RUNTIME PROCESSOR
SGMLC is an SGML document processing language which is based extensively upon the C programming language. It is the language at the heart of a
suite of products, developed by SGML Systems Engineering, which enable
the creation of powerful and flexible SGML document processing systems.
These may range from simple transformation tools, for example to convert
an SGML document into another form, to complex IETMS's and batch page
composition systems.
Programs, or "rules files", are written in SGMLC using any text editor.
They are then compiled into object form by the SGMLC compiler. There are
several versions of the SGMLC compiler, ranging from a basic free one to
the complete SGMLC Application Development Environment, which
incorporates an interactive debugger, C language DLL interface, and
system customisation facilities.
The resulting object file may then be freely distibuted to anyone who
has the free SGMLC runtime processor. An SGML document is processed by simply opening it in the runtime processor and applying the relevant
rules file.
The free version of the compiler, together with the free runtime
processor, will be all that is needed to create stand-alone SGML
document transformation applications. It is hoped that the free
availability of these tools will remove the need for hybrid, multi-pass
applications such as, for example, using perl to process the output of
the sgmls parser.
For commercial organisations who require a supported product, but who do
not need the full Application Development Environment, there will be an
enhanced, supported version of the free compiler. The additional
facilities will include enhanced regular expression patterm matching
together with the ability to attach the inbuilt SGML parser to any
stream rather than just the input stream. This will enable, for example,
complex up-translations where the parser is used to parse the generated
output. The output parser data structures are then available for making
conditional decisions whilst processing the input.
A range of SGMLC add-on libraries will also be available. These include
the SGMLC-View library, which provides the functionality for creating
complex SGML browser applications, including support for complex tables
and equations. The SGMLC-CALSTAB library provides pre-written viewer
support for processing CALS tables. It conforms to the recommendations
of the SGML Open White Paper on Tables Interoperability.
The range of products, which will be made generally available after the
SGML 95 conference for the MS Windows operating systems, may be seen at
the SGML Systems Engineering booth in the exhibition area at SGML 95. A
limited number of diskettes, containing the free products and examples
of some sophisticated viewer applications, will be available there.
After SGML 95 the free SGMLC compiler and runtime processor will be made
publicly available via the usual SGML archive ftp sites.
regards,
Bruce Hunter
SGML Systems Engineering
bruce@sgml.dircon.co.uk
From: jbw@bigbird.bu.edu (Joe Wells)
Newsgroups: comp.text.sgml
Subject: Re: Content Model Blues
Date: 27 Nov 95 02:32:40 GMT
Organization: Boston University Computer Science Department
Message-ID: \
References: <4981lf$qcj@agate.berkeley.edu> \
In-reply-to: jbw@bigbird.bu.edu's message of 26 Nov 95 19:47:24 GMT
In article \ jbw@bigbird.bu.edu (Joe Wells) writes:
The HTML 2.0 DTD botched the handling of this issue in a different way.
The following is *valid* by the HTML 2.0 DTD:
\
\
\
Here is
\
the title.
\
\
\
\ Here is the body. \
\
\
I would like to correct myself. My statement was based on the August 4,
1995 version of the HTML 2.0 DTD. In that version, the TITLE element was
declared as follows:
\
It occurred to me that this problem might have been fixed in the final
version in RFC 1866. I just checked and indeed the problem had been
fixed. The declaration of the TITLE element now reads like this:
\
What is a bit confusing to me is why a declared content of RCDATA was not
used instead of a content model with an exclusion exception. Is there
some disadvantage to a declared content of RCDATA? (Yes, I know about the
problems with CDATA, I am asking about *R*CDATA.)
--
Joe Wells \
From: kshapiro@cyberus.ca (Kivi Shapiro)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: hdp://
Date: 26 Nov 1995 22:55:37 -0500
Organization: Graduate School of Library and Information Science, UWO
Message-ID: <49bcrp$qq4@cyberhome.cyberus.ca>
References: <47d5ro$o3j@hamilton.maths.tcd.ie> <48vf7f$nuc@dove.nist.gov> \
Reply-To: kshapiro@pobox.com (Kivi Shapiro)
Summary: The time is ripe.
In article \,
Piercarlo Grandi \ wrote:
>
> \
>
>(hdp of course is the "hardware download protocol") seems rather hard to
>do.
But it fills a need! Can you imagine the usefulness of this protocol?
I'll help write the RFC, if you like. The Web will never be the same.
- Kivi, who thinks the rec.arts.startrek.tech folks might be able to help
--
kshapiro@julian.uwo.ca or kivi@pobox.com (Kivi Shapiro)
All original material in this message is in the public domain.
Newsgroups: comp.text.sgml,alt.fan.bill-gates,alt.fan.mozilla,alt.fan.newt-gingrich,alt.politics.usa.republican
From: poulosio@netcom.com (Mad Max)
Subject: Re: Slight flame on SGML
Message-ID: \
Organization: Troll Busters Ltd. Melbourne VIC.
References: <9511171331.AA12385@fly.HiWAAY.net> <48qmfi$14r@bell.maths.tcd.ie>
Date: Mon, 27 Nov 1995 04:32:47 GMT
Sender: poulosio@netcom16.netcom.com
Timothy Murphy (tim@maths.tcd.ie) wrote:
: * 25% of World Wide Web users have incomes of more than $80,000.
If that is anywhere true, they all hang out in alt.politics.* newsgroups,
which could be rmgrouped and replaced with
alt.flame.the-poor.get.off.your.lazy.arses.and.get.a.bloody.job .
Of course it would be named in American-made English instead, which is
the M$-DOG of human language, of course.
--
"When I collect two solar masses of AOL Disks, I will use them to detonate
the Sun" Web Page URL: ftp://ftp.netcom.com/pub/po/poulosio/poulosio.html
Mad Max, LT, TRES Corps, Kappa. Shut the immigration inlet valve NOW!
Linux: You get what you pay for.
From: pepper@falch.no (Steve Pepper)
Newsgroups: comp.text.sgml
Subject: Re: Numeric Tags and Parsing Errors
Date: Mon, 27 Nov 1995 12:56:11 LOCAL
Organization: Falch Infotek AS, Norway
Message-ID: \
References: \ \
In article \ sjd@ebt.com
(Steven J. DeRose) writes:
|In article \,
|tim@michelob.wustl.edu (Tim Rand) wrote:
|
|> We have recently run across a problem were codes that begin with numerics
|> are passed through parsing, yet are invalid.
|>...
|> Example: <3a> passed through and it isn't defined in the DTD.
|
| [...]
|
|In the SGML declaration you thoughtfully supplied the usual naming rules
|are in effect, so "3" is a name character but not a name *start* character
|(that is, names like GIs can't start with it).
|
|So in "<3a>" the "<" is not a STAGO in context, and is just data. Assuming
|#PCDATA is allowed at the points in question, that won't produce an error.
|
| [...]
|
|If you want <3a> as a tag, you'd need to add digits to the list of name
|start characters, and then declare the element type as usual.
Unfortunately this is one of the few areas in which the SGML syntax isn't
configurable. [189] naming rules states that "A _character_ assigned to
LCNMCHAR, UCNMCHAR, LCNMSTRT, or UCNMSTRT cannot be an LC Letter, UC Letter,
Digit, RE, RS, SPACE, or SEPCHAR."
In other words: You can't just add digits to the list of name start
characters. There is no way of turning <3a> into a valid tag.
Steve
Steve Pepper, SGML Architect pepper@falch.no
Falch Infotek a.s, Postboks 130 Kalbakken, N-0902 Oslo, Norway
http://www.falch.no/ tel://+47 2290 2733 fax://+47 2290 2599
"The Whirlwind Guide": http://www.falch.no/~pepper/sgmltool/
From: tonyk@cybercom.net (Antoun Kanawati)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: Re: hdp://
Date: Mon, 27 Nov 1995 07:59:02 -0500
Organization: at home
Message-ID: \
References: <47d5ro$o3j@hamilton.maths.tcd.ie> <48vf7f$nuc@dove.nist.gov> \ <49bcrp$qq4@cyberhome.cyberus.ca>
In article <49bcrp$qq4@cyberhome.cyberus.ca>, kshapiro@pobox.com (Kivi
Shapiro) wrote:
> > \
> >
> >(hdp of course is the "hardware download protocol") seems rather hard to
> >do.
>
> But it fills a need! Can you imagine the usefulness of this protocol?
> I'll help write the RFC, if you like. The Web will never be the same.
Sounds like trouble. PC's have this tendency to behave rather strangely
if you don't exactly the right mixture of versions (OS, Windows, DLL's,
etc...) Things are bad enough with some installers overwriting DLLs
indiscriminately. Getting what appears to be device drivers loaded thru
WWW sounds like a major security hole, and a nice way to crash many delicately
configured platforms.
--
Antoun (Tony) Kanawati
tonyk@cybercom.net
http://www.cybercom.net/~tonyk/
From: liora@delphi.com
Newsgroups: comp.text.sgml
Subject: Re: SGML book?
Date: Mon, 27 Nov 95 08:42:53 -0500
Organization: Delphi (info@delphi.com email, 800-695-4005 voice)
Message-ID: \
References: <480i5v$hsp@usenet.ucs.indiana.edu> \ <1995Nov20.045420.29244@sq.com> <1995Nov20.214942.10037@sq.com> \ \
tore@lis.pitt.edu (Tore Joergensen) wrote:
>kaikow@standards.com wrote:
>
>: Is the Alschuler book better than the Practical SGML
book by Van
>: Herwijejnen? If so, how?
>
>: In article \, Peter Flynn
\ wrote:
>: >Dixit pyachnes@ucs.indiana.edu (paul alan yachnes):
>: >> Can someone recommend the best book for learning
SGML?
>: >The best one I can recommend is "ABCD... SGML: A User's
Guide
>: >to Structured Information" by Liora Alschuler (ISBN: 1-
850-32197-3)
>
>"ABCD...SGML" is not a book for learning SGML. The
explanation of the
>SGML syntax is very brief. This book tries to explain the
big picture,
>rather than to learn you how to write SGML.
>
> "Practical SGML" by Eric van Herwijnen is a book for
learning SGML,...
Quite true, if by "learning SGML" Mr. Joergensen means
learning SGML syntax. There is little overlap in function
between my book and van Herwijnen's or the other books on
SGML syntax mentionned in the same post.
An earlier post also distinquished between the two types of
book saying that those who are learning to write SGML syntax
needn't explore the choices for the "big picture" type of
book. Also a statement that I must agree with -- assuming
that these syntax writers have the big picture already in
mind, garnered through other sources, before they sit down
to write code.
I have one other clarification on the position of my book in
terms of the others mentionned in this thread and that is
that unlike at least one of the two recent books that
attempt to put SGML into a broad perspective, my book does
not deal with the Web as if it should just go away. Two of
the book's case studies are of Web sites, one in HTML, one
in SGML, and the book comes bundled with Web-ready Panorama
files and a custom build of SoftQuad's Panorama Web viewer
for SGML... and much, much more, as they say.
Finally, I want to correct the Web addressses I posted
earlier. At the current time, the HTML excerpts from ABCD...
are available on:
http://www.thomson.com/itcp/liora/LA_HOME.HTM
NOTE: No final "L" (hope to have this fixed soon)
The SGML excerpts will be posted to
http://www.thomson.com/itcp.html under WebExtras.
NOTE: No intermediate /itcp/ directory.
Sorry if this caused any inconvenience.
Regards,
Liora
Liora Alschuler
The Word Electric
POB 177
East Thetford, VT 05043
802/785-2623
liora@delphi.com
Newsgroups: comp.text.sgml
From: jbottoms@world.std.com (John W Bottoms)
Subject: Content Markup & Blackbird
Message-ID: \
Organization: The World Public Access UNIX, Brookline, MA
Date: Mon, 27 Nov 1995 16:37:36 GMT
There are too many areas to keep up with. I was reading in "Wired"
that Blackbird will support content markup with Microsoft Markup
Language (MSML?) which is based on SGML. Has any information been
published on how MS will determine the content structure for the
various communities of interest?
Also, BillG has a book out called "The Road Ahead" which may may
reference to these future trends.
-jb
Newsgroups: comp.text.sgml
From: mstewart@sq.com (Mike Stewart)
Subject: SGML, The New World Tour CD-ROM
Message-ID: <1995Nov27.205343.12374@sq.com>
Sender: news@sq.com (News Administrator)
Organization: SoftQuad Inc,
Date: Mon, 27 Nov 1995 20:53:43 GMT
About a year ago SoftQuad Inc. released a CD-ROM (SGML, The World Tour)
full of stuff about SGML. The purpose of the CD was to be a reference for
and to promote the use of SGML. We're doing it again and this time we're
looking for your input. Our updated version, which will be titled,
"SGML, The New World Tour" will be available for free. We are,
therefore, limited to royalty-free content. We are targetting the content
towards three levels of experience: those who have never encountered SGML,
those who have some familiarity with it, and those who are already involved
with SGML but wish to expand their knowledge.
We are organizing materials under the following headings: "Starting Out in
SGML"; "Tutorials and Demos"; "Who's Using SGML (The Web, Industry,
Experts and Accessibility)"; "Resources and References" (including history
and F.A.Q.s); and "SoftQuad, Inc.".
Contributors (if we use your stuff) are entitled to free copies of the CD
when it comes out. We are looking for marked up contributions (we'll be
using SoftQuad's Panorama to browse the material). We're also open to
suggestions and pointers to neat stuff (like SGML freeware and cool
projects).
Email us at worldtour@sq.com and we can work out the details of how to
submit and when. Oh yeah, by the way we're looking for final submissions
by December the 15th. I know, I know, it's close. If this were an ideal
world we'd have got on it sooner (but then again publishing dates wouldn't
move up either).
P.S. The old World Tour CD is available, contact us for details.
====================
Mike Stewart SGML, The New World Tour
SoftQuad Inc. (416) 239-4801 xt 203
mstewart@sq.com
From: lazio@spacenet.tn.cornell.edu (T. Joseph W. Lazio)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: Re: hdp://
Date: 27 Nov 1995 21:18:01 GMT
Organization: Department of Astronomy, Cornell University
Sender: jl26@cornell.edu (Verified)
Message-ID: \
References: <47d5ro$o3j@hamilton.maths.tcd.ie> <48vf7f$nuc@dove.nist.gov> \ <49bcrp$qq4@cyberhome.cyberus.ca> \
Reply-To: lazio@spacenet.tn.cornell.edu
In-reply-to: tonyk@cybercom.net's message of Mon, 27 Nov 1995 07:59:02 -0500
>>>>> "AK" == Antoun Kanawati \ writes:
In article <49bcrp$qq4@cyberhome.cyberus.ca>, kshapiro@pobox.com (Kivi
Shapiro) wrote:
> > \
> >
> >(hdp of course is the "hardware download protocol") seems rather hard to
> >do.
>
> But it fills a need! Can you imagine the usefulness of this protocol?
> I'll help write the RFC, if you like. The Web will never be the same.
AK> Sounds like trouble. PC's have this tendency to behave rather
AK> strangely if you don't exactly the right mixture of versions (OS,
AK> Windows, DLL's, etc...) Things are bad enough with some
AK> installers overwriting DLLs indiscriminately. Getting what
AK> appears to be device drivers loaded thru WWW sounds like a major
AK> security hole, and a nice way to crash many delicately configured
AK> platforms.
Ah, but I'm sure that with 50000 bit secure encrypting combined with
the appropriate salts, transfer of the drivers can be accomplished.
Seems to me the more difficult issue will be whether the bandwidth is
sufficient to handle the the necessary encoding to transfer the actual
hardware.
--
Cornell knows I exist?!? | e-mail: lazio@spacenet.tn.cornell.edu
Sgt. Lazio, HTML police | http://astrosun.tn.cornell.edu/students/lazio/
STOP RAPE | ICBM: 42:20:08 N 76:28:48 W 305 m alt.
sci.astro FAQ at http://astrosun.tn.cornell.edu/students/lazio/sci.astro.html
Newsgroups: comp.text.sgml
Message-ID: <9511280033.AA0192@notes.microstar.com>
From: Ken Holman \
Date: 27 Nov 1995 16:31:42 GMT
References: \
Subject: Re: What SGML products are using OLE?
[Lori Bolen]
> I was wondering which SGML products are using OLE? Are there any?
The NEAR & FAR Author for Microsoft Word product (now shipping V1.0) will
allow external entities (referenced either as general entities in content
or the entity attribute of an element) to be linked (not embedded)
OLE-servicable objects.
\
\
\
\
\
\
]>
\
\
This is a general entity reference to an OLE object: \&sheet;
\
This is an element reference to an OLE object:\
\
The object is displayed within the Word environment; double-clicking on the
entity construct, or selecting the menu item Edit/LinkedObject on the
element construct, will invoke the appropriate OLE server.
Configuration files allow any SYSTEM identifier to be mapped to OLE servers
through WIN.INI field values, and the catalog file maps PUBLIC identifiers
to SYSTEM identifiers.
........... Ken
G. Ken Holman Tel: +1 613 596-CADE(2233) /\\ /\\
Chief Technology Officer Fax: +1 613 596-5934 \\/ \\/ Computer
Microstar Software Ltd. WATS: 1 800 267-9975 /\\ /\\ Aided
3775 Richmond Road Mail: gkholman@microstar.com \\/ \\/ Document
Nepean Ontario Info: cade@microstar.com /\\ /\\ Engineering
CANADA K2H-5B7 Web: http://www.microstar.com \\/ \\/
*** Please Note New Address ***
From: bmoore@neonramp.com (Bill Moore)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: Re: hdp://
Date: Mon, 27 Nov 1995 17:19:30 +0200
Organization: Digital Revolution
Message-ID: \
References: <47d5ro$o3j@hamilton.maths.tcd.ie> <48vf7f$nuc@dove.nist.gov> \ <49bcrp$qq4@cyberhome.cyberus.ca>
In article <49bcrp$qq4@cyberhome.cyberus.ca>, kshapiro@pobox.com (Kivi
Shapiro) wrote:
> In article \,
> Piercarlo Grandi \ wrote:
> >
> > \
> >
> >(hdp of course is the "hardware download protocol") seems rather hard to
> >do.
>
> But it fills a need! Can you imagine the usefulness of this protocol?
> I'll help write the RFC, if you like. The Web will never be the same.
>
> - Kivi, who thinks the rec.arts.startrek.tech folks might be able to help
> --
> kshapiro@julian.uwo.ca or kivi@pobox.com (Kivi Shapiro)
> All original material in this message is in the public domain.
Sorry folks, but I missed something. hdp hardware download protocol ?
Anyone care to explain? ( I gotta read this group more often!)
Bill
--
Bill Moore Internet Publishing & Design
Editor, Sr. Partner
Digital Revolution Publishers of MacWare Revue
399 N 117th,Suite 301 http://www.piazza.com
Omaha, NE 68154 MacO/S Freeware/Shareware Reviews
\MacWare Revue\
From: jsbell@acs.ucalgary.ca (Joshua Sean Bell)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: Re: hdp://
Date: 28 Nov 1995 02:28:18 GMT
Organization: University Computing Services (The University of Calgary)
Message-ID: <49ds42$ri8@ds2.acs.ucalgary.ca>
References: <47d5ro$o3j@hamilton.maths.tcd.ie> <49bcrp$qq4@cyberhome.cyberus.ca> \ \
T. Joseph W. Lazio \ wrote:
>>>>>> "AK" == Antoun Kanawati \ writes:
>
>In article <49bcrp$qq4@cyberhome.cyberus.ca>, kshapiro@pobox.com (Kivi
>Shapiro) wrote:
>> > \
>> >
>> >(hdp of course is the "hardware download protocol") seems rather hard to
>> >do.
>>
>> But it fills a need! Can you imagine the usefulness of this protocol?
>> I'll help write the RFC, if you like. The Web will never be the same.
>
>AK> Sounds like trouble.
>
> Ah, but I'm sure that with 50000 bit secure encrypting combined with
>the appropriate salts, transfer of the drivers can be accomplished.
>Seems to me the more difficult issue will be whether the bandwidth is
>sufficient to handle the the necessary encoding to transfer the actual
>hardware.
We'll have to state in the RFC that browsers are *required* to fetch
hdp: method URLs, and ignore any attempts at content negotiation. I
mean, imagine the poor user who will have to sit through the microseconds
necessary for this:
Accept: Macintosh/PowerPC-NuBus; q=0.9, Macintosh/PowerPC-ISA; q=0.1
Content-type: Wintel/PentiumPro-ISA
Accept: Macintosh/PowerPC-NuBus; q=0.9, Macintosh/PowerPC-ISA; q=0.1
Content-type: I SAID *Wintel/PentiumPro-ISA*
Accept: Macintosh/PowerPC-NuBus; q=0.9, Macintosh/PowerPC-ISA; q=0.1
Content-type: Fuck you, Macintosh sucks!
Accept: Macintosh/PowerPC-NuBus; q=0.9, Macintosh/PowerPC-ISA; q=0.1
Content-override: munition/x-hydrogen-bomb; v=10.1MT
Obviously, the Macintosh should recognize its obselecense and gracefully
commit sepuku (sp?) before the Mightyness That Is Bill.
Joshua
--
___.----~~~----.___ | MIME: jsbell@acs.ucalgary.ca
,--------.-.,-'-------------------` | WWW: http://www.ucalgary.ca/~jsbell/
`--------"-'-.,---`~~~-----~~~'
'---'-._____/ --===( Note: the above is humor. Hu-mor. Look it up.)
From: K. McLennan <71431.2255@CompuServe.COM>
Newsgroups: comp.text.sgml
Subject: ISO12038 Math viewer advice needed
Date: 28 Nov 1995 00:20:56 GMT
Organization: CompuServe, Inc. (1-800-689-0736)
Message-ID: <49dkl8$nc0$2@mhadg.production.compuserve.com>
I'm looking for a PC utility capable of displaying WYSIWYG, or near WYSIWYG, representations of ISO12038 Mathematics. Any advice would be greatly appreciated.
Thanks in advance,
- Karl
From: islander@msen.com (Michael Bauser)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: Re: Slight flame on SGML
Date: 28 Nov 1995 02:32:04 -0500
Organization: Vagreangvbany Fbpvrgl sbe gur Cerfreingvba bs ebg13
Message-ID: \
References: <47d5ro$o3j@hamilton.maths.tcd.ie> \ <47o0tp$fjb@icarus.convex.com> <19951108T121248Z@naggum.no> <47tn8b$m7r@news.iastate.edu> <487tql$mus@dove.nist.gov> <48bv00$79@doc.jmu.edu> \ <48q6u9$5d3@dove.nist.gov> \ <48t627$fr6@voyager.Internex.NET>
Reply-To: islander@msen.com (Michael Bauser)
-----BEGIN PGP SIGNED MESSAGE-----
Following-up from \
In article <48t627$fr6@voyager.Internex.NET>,
davido@apocalypse.org (David Ornstein) wrote:
> smishra@gaia.cc.gatech.edu (Sunil Mishra) wrote:
> At the risk of stepping out having not read all the important
> documents in detail, let me ask about style sheets and markup. It has
> seemed to me that the CLASSes being added to the various tags (e.g.,
> \) are really a poor-man's way to add new tags (e.g.,
> \). Is this basically right?
>
> Ideally (of course?) browsers would support displaying custom tags
> defined in DTDs that are specific to the documents being viewed. I
> know they don't do this (I assume because it's a lot of work and they
> have too many other things on thier minds),
Poppycock. Balderdash. Foolishness. They don't do this because they know
that most people couldn't/wouldn't learn enough SGML to write their own
DTD. For God's Sake, Marc "Excuse me while I take over the Web" Andreesen
can't create SGMLable extensions -- what makes you think all his followers
could?
> but why didn't the style
> sheet folks design something that was a bit more in line (like
> allowing the style sheet information to define the new tags for a
> browser using stuff in the header -- thus avoiding the full complexity
> of DTDs)?
The Cascading Style Sheet proposal *does* have a limited ability to
define new tags, although it's not the primary thrust of the proposal.
(I wouldn't really recommend using it, myself -- suddenly defining
new tags isn't very backward compatible, after all.)
Besides, DTDs are about syntax, not representation. I can create a
DTD that tells a parser where \
From: HARTMUT BECKER \
Newsgroups: comp.text.sgml
Subject: Re: SGML -> HTML
Date: 28 Nov 1995 16:47:19 GMT
Organization: University of Rostock
Message-ID: <49feen$18i@taiwan.informatik.uni-rostock.de>
References: <48g54e$8d3@news.INbe.net>
To: carrasco@innet.lu
"M.T. Carrasco Benitez" \ wrote:
>I am looking for SGML->HTML filter, or a pointer to the FAQ where this
>kind of answer could be. Mail prefered.
Try SGMLS.pm ("a perl5 class library for handling output from the SGMLS
and NSGMLS parsers"); URL:
http://www.uottawa.ca/~dmeggins/SGMLSpm/sgmlspm.html
(You have to use SGMLS/SGMLS parser for parsing your SGML documents; URL for
NSGMLS:
http://www.jclark.com/sp.html)
With SGMLS.pm there is a special script (sgmlspl) coming which is taking a
so-called specification file to handle all the elements of your document. For
each start tag and end tag there is an event where you can specify what to do
when the tag is occuring.
You can even produce a specification file template with sgmlspl. You just have
to fill the events with Perl5 code.
I have been using SGMLS.pm/sgmlspl for one week. It seems to be quite a good
tool. At the same time I started learning Perl. (What an awful language; why is
there no well-designed C++ class library available for this?!)
Hartmut Becker (hbecker@informatik.uni-rostock.de)
From: mwm@contessa.phone.net (Mike Meyer)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: hdp:// & sdp:// (Was: Slight flame on SGML)
Date: Tue, 28 Nov 95 09:32:37 PST
Organization: Missionaria Phonibalonica
Message-ID: <19951128.75B52E8.8D50@contessa.phone.net>
References: <47d5ro$o3j@hamilton.maths.tcd.ie> <48vf7f$nuc@dove.nist.gov> \
In \, pcg@aber.ac.uk (Piercarlo Grandi) wrote:
> What do you do in this case?
> \
> (hdp of course is the "hardware download protocol") seems rather hard to
> do.
Ah, a solution to my problem with "Download NetScape Now"):
\
Download \NetScape Now,\
and if you're foolish enough not to own a PC, get \one of those\ with your
choise of OS: \Windows
NT\, \, Windows
95\ or \Unix.\
\
From: Martin Morrison \
Newsgroups: comp.text.sgml
Subject: Re: SGML, The New World Tour CD-ROM
Date: 28 Nov 1995 18:25:17 GMT
Organization: Shared Medical Systems
Message-ID: <49fk6d$epe@bdt.bdt.com>
References: <1995Nov27.205343.12374@sq.com>
To: pmarks@allegra.com
--
Martin Morrison
Sr. Technical Writer/Documentation Coordinator
Shared Medical Systems - Turnkey Systems Division
From: jenglish@crl.com (Joe English)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml
Subject: Re: Slight flame on SGML
Followup-To: comp.text.sgml
Date: 28 Nov 1995 10:06:35 -0800
Organization: Mentally ill venomous purists with strange emotional drives
Message-ID: <49fj3b$5ef@crl2.crl.com>
References: <47d5ro$o3j@hamilton.maths.tcd.ie> <48q6u9$5d3@dove.nist.gov> \ <48t627$fr6@voyager.Internex.NET>
David Ornstein \ wrote:
>[...] It has
>seemed to me that the CLASSes being added to the various tags (e.g.,
>\) are really a poor-man's way to add new tags (e.g.,
>\). Is this basically right?
Exactly.
The "usual" way to accomplish this in SGML is to define a "meta-DTD"
or "architecture" for a particular application (e.g., online
delivery), and then map elements in the source DTD ("CHAPTER",
"SECTION") to architectural forms in the meta-DTD ("DIV"). This can
be done with a link process definition or by adding #FIXED attributes
to the source DTD:
\
...
]>
This way the element types in the source DTD reflect what's
important semantically ("this is a chapter"), and an attribute
tells a specific application (e.g., a Web browser) how to process
them ("this is a hierarchical division").
HTML 3 with the CLASS attribute is an SGML architecture turned
inside out. With this approach, the _element types_ tell the
application how to process each element, and an _attribute_
specifies the semantically important information: \
This is a nice compromise that doesn't require too much SGML
sophistication on the part of authors or Web browsers, but it
allows at least some semantic information to be retained when
converting other SGML documents into HTML.
>Ideally (of course?) browsers would support displaying custom tags
>defined in DTDs that are specific to the documents being viewed. I
>know they don't do this (I assume because it's a lot of work and they
>have too many other things on thier minds), but why didn't the style
>sheet folks design something that was a bit more in line (like
>allowing the style sheet information to define the new tags for a
>browser using stuff in the header -- thus avoiding the full complexity
>of DTDs)?
With SGML, it's not possible for a stylesheet to define new element
types ("tags"); that can only be done in the DTD.
At least one earlier version of HTML 3/HTML+ (circa late 1993)
provided a mechanism whereby authors could define new element types.
If memory serves, it looked something like:
\
\
\
\
\
\
]>
\
\...
\
\
You can use \person>, \place>,
and \thing> tags in this document.
...
\\
This scheme was eventually rejected.
I expect that HTML 4 (if and when that happens) will be defined
as an SGML architecture instead of an SGML DTD.
--Joe English
jenglish@crl.com
From: mchepeka@microstar.com ()
Newsgroups: comp.text.sgml
Subject: Microstar announces release of NEAR & FAR Author
Date: Tue, 28 Nov 1995 21:14:21 GMT
Organization: HookUp Communication Corporation, Oakville, Ontario, CANADA
Message-ID: <49fu4r$cup@noc.tor.hookup.net>
FOR RELEASE IMMEDIATELY
MICROSTAR ANNOUNCES RELEASE OF NEAR & FAR(r) AUTHOR, STRUCTURED
AUTHORING ENVIRONMENT FOR MICROSOFT(r) WORD 6.0
OTTAWA, CANADA, November 27, 1995 -- Microstar Software Ltd. today
announced the release of NEAR & FAR Author, its structured, graphical
authoring product for Word.
Designed as a Word add-on, NEAR & FAR Author enables writers to create
structured documents based on the international standard, SGML
(Standard Generalized Markup Language) right in their familiar Word
environment with no knowledge of SGML required.
"No other software today lets authors create and edit structured
documents quickly and easily with no knowledge of SGML required -- all
inside their Microsoft Word environment," said Shaun McEwan,
Microstar's President.
NEAR & FAR Author uses a graphical document model to act as the
document's structure guide. Work can be stored as a Word document or
easily saved as an SGML file. Writers can import and export SGML
documents and format or print the documents using Word's extensive
feature set.
NEAR & FAR Author's key user-driven features include:
*Quick startup
*Graphical tree map of document model
*Intuitive interface
*On-line author's guide
*Continuous validation
*Requires no knowledge of SGML syntax
*Interactive DTD view
*WYSIWYG on-screen display
Peter Jordan, Microstar's Chairman and CEO, explained that SGML
documents are "intelligent documents." "Intelligent documents save
businesses time and money. Information can be standardized and reused,
at the same time that issues like document conversion are eliminated.
With NEAR & FAR Author for Word, corporations are taking the first
step towards enterprise-wide document management."
"Not many writers know SGML, but almost everyone knows Microsoft
Word. That gives us a bigger pool of resources to draw from and we
can eliminate the time spent training writers how to produce
SGML-compliant documents," said Brian Fitzpatrick, of US-based
Information Handling Services, an information access solutions
provider.
Jordan also pointed out, "With the introduction of NEAR & FAR Author,
businesses that want structured documents do not need to invest
heavily in tools and retraining. They also gain the added advantage
of a wider variety of publishing destinations including CD-ROM and the
World Wide Web."
* Microstar's introductory pricing for NEAR & FAR Author is $249 (US)
per copy, 30% off the regular retail cost until January 31, 1996.
-30-
Microstar Software Ltd. is a publicly traded software and services
company listed on The Toronto Stock Exchange (MSS).
NEAR & FAR and C.A.D.E. are registered trademarks of Microstar
Software Ltd. Microsoft is a registered trademark of Microsoft
Corporation.
For more information please contact:
Mandy Chepeka
Marketing Communications Specialist
Microstar Software Ltd.
3775 Richmond Road
Nepean, Ontario
Canada
K2H 5B7
Ph: 613-596-2233
Fx: 613-596-5934
Internet: mchepeka@microstar.com
OR
Cathy Browne
Account Director
Hill & Knowlton
55 Metcalfe St., Suite 1300
Ottawa, Ontario
Canada
K1P6L5
Ph: 613-786-9951
Fx: 613-238-8642
Internet: cbrowne@ott.hillknowlton.ca
From: Michael Hilton <100016.3262@CompuServe.COM>
Newsgroups: comp.text.sgml
Subject: How many elements do you have?
Date: 28 Nov 1995 21:54:02 GMT
Organization: -
Message-ID: <49g0dq$309$1@mhafc.production.compuserve.com>
The DTD I use has over 100 elements & about 20 PIs. Is that a lot?
It certainly makes it harder writing conversions to other things.
-Michael
--
Michael Hilton
From: John Lamp \
Newsgroups: comp.text.sgml
Subject: Re: Is there a thing such as http://www.sgmlopen.home?
Date: 29 Nov 1995 01:33:01 GMT
Organization: University of Tasmania, Department of Computer Science
Message-ID: <49gd8d$lbj@franklin.cc.utas.edu.au>
References: \
kaikow@standards.com wrote:
>
>Or something like that?
Yes, have a look at the links on http://lamp.cs.utas.edu.au/net.html
Cheers
John
--
_--_|\\ John Lamp, originating in Hobart, Tasmania
/ \\ Phone: 002 20 2375 - Fax: 002 20 2913
\\_.--._/ email: John.Lamp@cs.utas.edu.au
v <--<< http://lamp.cs.utas.edu.au/jw_lamp.html
Information Systems Group, Dept of Comp Science, Uni of Tasmania
Newsgroups: comp.text.sgml
From: jjr@panini.att.com (Jeff Rocca)
Subject: SGML_PATH (sgmls vs. nsgmls)
Message-ID: \
Sender: news@nntpa.cb.att.com (Netnews Administration)
Organization: AT\&T Bell Laboratories
Date: Wed, 29 Nov 1995 00:55:00 GMT
I have been using the SGML_PATH environment variable with sgmls to specify
where the entity manager should look for files. For example, if my DTD file
contains the following lines
\
%foo;
and I
export SGML_PATH=/usr/home/entities/%N
prior to running sgmls, sgmls correctly finds the file named "foo" in the
directory named /usr/home/entities.
I have now migrated to nsgmls and discovered that the SGML_PATH environment
variable is no longer available. How can I achieve the same results with
nsgmls?
Thanks,
Jeff Rocca
jjr@pez.att.com
From: j_mcarthur@BIX.com (Jeffrey McArthur)
Newsgroups: comp.text.sgml
Subject: Re: Program to take SGML source and produce a DTD
Date: 29 Nov 1995 00:59:50 GMT
Organization: \
Message-ID: <49gba6$qrv@news2.delphi.com>
References: \
Marcy Thompson (Netcom Online Communications Services (408-241-9760 login:
guest)) wrote:
>Do not be deceived into believing that document analysis is the act of
>"recording what is there". It is the act of expanding your vision to
>see what is there and what is not and then creatively designing an
>architecture that supports both your notions of what is there and
>your ideas of what is not there. Designing a DTD is a profoundly
>creative act, and if you approach it as the task of "tagging what
>I see", you will get the wrong DTD, I promise.
My opinion is simple: read Deming. Dig out a copy of "Sample Design in
Business Research" ISBN 0-471-52370-4. Think about applying Deming's
standard design philosophies to DTD development.
----
Jeffrey M\\kern-.05em\\raise.5ex\\hbox{\\b c}\\kern-.05emArthur
a.k.a. Jeffrey McArthur email: j_mcarthur@bix.com
home: (410) 290-6935
The opinions expressed are mine. They do not reflect the opinions
of my employer. My access to the Internet is NOT paid for by my
employer. My access to the Internet is on my own time at my own
expense.
Newsgroups: comp.text.sgml
Date: Wed, 29 Nov 1995 10:48:33 +0100
From: Stéphane Bornnerand \
Message-ID: <199511290948.KAA22985@cygne.ais.berger-levrault.fr>
Subject: SGML'95 - AIS
AIS will be at the GCA SGML '95 Conference and Expo, Sheraton Boston Hotel
and Towers, Boston, December 4-7.
AIS will present the new release of the Balise(TM) product.
Balise is an SGML document processing language with usual features to
handle text and specialized features to handle SGML structures under
various representations such as events and trees.
The main characteristics of Balise could be listed as follows:
· Compiled/interpreted language
· Automatic memory management
· Portable (source and binary level)
· Modular and Extensible Architecture (5 set of APIs)
· C-like syntax with object-oriented declaration
· General purpose with SGML knowledge buit-in at the core
· Mixed SGML document processing mode (event-driven, Tree navigation)
· SGML Context handling based on both backward and look-ahead
· Source level debugger
The Balise V3 supports the new features:
New embedded parser
This new Balise release is based on James Clark's new SP parser, which
supports all SGML features (except CONCUR) and allows for any SGML
declaration.
Support for double-byte characters
Balise is now based on Unicode for internal character representation and
fully supports double-byte encoding systems, including for text search and
regular expression matching functions. JIS, Shift-JIS and EUC are accepted
as input/output coding systems.
Built-in ESIS-tree manipulation primitives
The direct tree-manipulation primitives available through the Polypus
library in previous versions are now part of the Balise kernel, allowing
for improved performance, reduced memory consumption, and seamless
integration with standard Balise programming.
Built-in source-level debugger
Balise now provides a full-blown source-level debugger as part of the
Balise program itself. Large programs can now be debugged easily.
New DLL based architecture
On most platforms (Unix, Windows, Windows NT), Balise is now implemented as
a set of dynamic libraries, allowing easy substitution of some components
(such as the SGML parser) and easy integration with other software.
Balise SDK
The Balise Software Developer's Kit draws upon the DLL architecture to
provide programmers with a full set of APIs to extend Balise and/or to use
it as an embedded engine.
New supported platforms
Balise V3 is available on: DOS (32 bits), Windows 95, Windows NT,
SUN/Sun-OS, SUN/Solaris, HP9xxx/HP-UX, RS6000, SGI, DEC alpha/OSF, Solaris
X86, Unixware.
In Europe, AIS will also partcipate to Documation '95 Conference and
Expo, Hotel Meridien Etoile, Paris, France, December 12 and 13.
AIS will demonstrate DynaPage(TM) product which is based on
Balise. DynaPage is a Page Composition Gateway which fills in the gap
between SGML and page makeup systems such as MS Word, FrameMaker, and
Interleaf.
For further information, contact Isabelle Bornier:
phone: [+33] 1 46 40 84 15
FAX: [+33] 1 46 40 84 10
email: ibor@ais.berger-levrault.fr
Regards,
Stephane Bornerand.
_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
_/ Stephane Bornerand phone: [+33] 1 46 40 84 06 _/
_/ AIS S.A. FAX: [+33] 1 46 40 84 10 _/
_/ 35, Rue du Pont email: sbor@ais.berger-levrault.fr _/
_/ 92200, Neuilly, FRANCE WWW: http://www.ais.berger-levrault.fr _/
_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
From: LSBUMGAR@vax1.acs.jmu.edu (lee s. bumgarner)
Newsgroups: comp.text.sgml,comp.infosystems.www.authoring.html,alt.culture.www
Subject: Re: Content Markup & Blackbird
Date: 29 Nov 1995 03:02:33 GMT
Organization: Undertoad Hypermedia Publishing Co.
Sender: LSBUMGAR@jmu.edu
Message-ID: <49gig9$qjb@doc.jmu.edu>
References: \
In-Reply-To: jbottoms@world.std.com's message of Mon, 27 Nov 1995 16:37:36 GMT
In \ jbottoms@world.std.com writes:
> There are too many areas to keep up with. I was reading in "Wired"
> that Blackbird will support content markup with Microsoft Markup
> Language (MSML?) which is based on SGML. Has any information been
> published on how MS will determine the content structure for the
> various communities of interest?
>
> Also, BillG has a book out called "The Road Ahead" which may may
> reference to these future trends.
The mere mentioning of Blackbird makes my bloodboil...Here are my questions:
1)Seriously, how likely is it that Blackbird will take over the Web and we
will have to wait for Blackbird98 to update the tech?
2)What is Hytime vis-a-vis Hyper-G...
3)How likely is it that the current http://www.xxxx.com setup, etc will
be changed to such an extent that the average user will have to learn a new
URL setup? (I personally find it somewhat unlikely that the current setup
is going to change dramaticlly.
please email response....
-l
---
-----> Undertoad (under construction) http://falcon.jmu.edu/~bumgarls/ <--------
REALITY.SYS corrupted. Reboot universe (Y/N/Q)? | "Usenet is Frosty The Snowman
committing suicide with a flame thrower."-- Kibo |Somebody visited a.r.k and all
I got was this lousy .sig | M$'s Blackbird is EVIL, do not support it! | ##30##
From: edwardj@interramp.com (Edward Jung)
Newsgroups: comp.text.sgml
Subject: Re: Content Markup & Blackbird
Date: Wed, 29 Nov 1995 08:43:12 GMT
Organization: PSI Public Usenet Link
Message-ID: <49h6hb$agp@usenet1.interramp.com>
References: \
jbottoms@world.std.com (John W Bottoms) wrote:
>There are too many areas to keep up with. I was reading in "Wired"
>that Blackbird will support content markup with Microsoft Markup
>Language (MSML?) which is based on SGML. Has any information been
>published on how MS will determine the content structure for the
>various communities of interest?
That markup language is defined by a specific SGML DTD. The DTD is
published to allow content providers to write translators. As I
understand it, this DTD was defined by the Blackbird designers (and
reviewed by some of their content partners).
===== My thoughts are my own =====
Edward Jung, edwardj@interramp.com
Newsgroups: comp.text.sgml
From: ak@oasis.icl.co.uk (Alfie Kirkpatrick)
Subject: Re: How many elements do you have?
Message-ID: \
Sender: news@oasis.icl.co.uk
Reply-To: ak@oasis.icl.co.uk
Organization: ICL, Bracknell, UK
References: <49g0dq$309$1@mhafc.production.compuserve.com>
Date: Wed, 29 Nov 1995 12:06:59 GMT
Michael Hilton (100016.3262@CompuServe.COM) wrote:
: The DTD I use has over 100 elements & about 20 PIs. Is that a lot?
: It certainly makes it harder writing conversions to other things.
: -Michael
100 elements doesn't sound too many. The real question is whether
your typical document uses them all. Most conversions out of a DTD
don't cater for every element/element context. I tend to write
the conversion scripts for our document set and a couple of other
docs if I can find them. Then write in clear warnings when an element
is passed over so that if an author uses a previously uncatered
for element, they can come and discuss it with you. You can then
decide whether to cater for it or tell them not to use it!
I do have the luxury of being able to discuss these issues
with authors (I also do some authoring myself) and am able to change
the processing whenever required.
I've never heard of processing instructions being specified in
the DTD (seems like a failing of the DTD design to me), but
maybe that's just my limited experience...
Hope this helps. Alfie.
--
+-Professional Publishing Services---------------+
| Alfie Kirkpatrick ICL |
| external: +44 1344 472500 Lovelace Road |
| internal: 7263 2500 Bracknell, Berks |
| mail: ak@oasis.icl.co.uk RG12 8SN |
+------------------------------------------------+
From: tfb@ed.ac.uk (Tim Bradshaw)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: Re: hdp://
Date: 29 Nov 1995 12:33:47 GMT
Organization: AIAI, University of Edinburgh
Message-ID: \
References: <47d5ro$o3j@hamilton.maths.tcd.ie> <48vf7f$nuc@dove.nist.gov> \ <49bcrp$qq4@cyberhome.cyberus.ca> \
In-reply-to: bmoore@neonramp.com's message of Mon, 27 Nov 1995 17:19:30 +0200
* Bill Moore wrote:
> Sorry folks, but I missed something. hdp hardware download protocol ?
It's an attempt to update and standardise the facilities provided by
the well-known UUMT system (unix-unix matter transfer) provided with
BSD 4.2 and later releases.
--tim
From: Bob Steele \
Newsgroups: comp.text.sgml,ba.jobs.offered
Subject: Wanted: Software Engineers: SGML, OODBMS, DMS
Date: Wed, 29 Nov 1995 11:40:35 -0800
Organization: Technology Deployment International
Message-ID: <30BCB733.1EE2@tdiinc.com>
CC: bsteele
Wanted: Software Engineers: SGML, OODBMS, DMS
Technology Deployment International (TDI) is looking to expand its
expertise in object-oriented databases to SGML and Document Management
Systems. We have a number of openings for engineers with backgrounds in
document management and object database internals. We see that as
corporations move from paper based document systems to electronic
repositories, the need for sophisticated storage and retrieval will be
great. Due to the complex structure of SGML documents, we believe
OODBMS's are well suited to this task. We also believe the WWW will play
a major role in providing the delivery of document services. This is an
opportunity to influence this industry.
TDI is a growing software consulting house with a great deal of expertise
in object-oriented databases. Many of us have strong academic
credentials in addition to our professional careers.
Our web site, http://www.tdiinc.com will give an indication of our
corporation and its direction.
Please contact:
Bob Steele, Vice President of Multimedia Information Systems
Technology Deployment International
4100 Moorpark Ave, Suite 116
San Jose, CA, 95117
--
Bob Steele, TDI Inc., 4100 Moorpark Ave, Suit 116, San Jose, CA, 95117
(408) 345 9157
mailto:bsteele@tdiinc.com http://www.tdiinc.com
From: kathy_graham@corp.dialog.com (Kathy Graham)
Newsgroups: comp.text.sgml
Subject: Re: CALS DTD table attributes
Date: Wed, 29 Nov 1995 21:28:28 GMT
Organization: Knight-Ridder Information, Inc.
Message-ID: <49iii9$aj5@iserv.dnt.dialog.com>
References: <49532v$it1@kannews.ca.newbridge.com>
Paul Madsen \ wrote:
>I'd like to get some info on the CALS table (and its subelements) attributes.
>Is there an archive defining the purpose of each of the attributes somewhere?
>Thnaks for any help
>Paul Madsen
>Online Documentation
>Newbridge Networks Corp
Another option is to get the complete CALS specification. You can get
to it electronically via the Navy WEB page at:
navysgml.dt.navy.mil
It is a long specification, but all of the gory details are there.
Hope this helps...
Kathy Graham
From: jenglish@crl.com (Joe English)
Newsgroups: comp.text.sgml
Subject: Re: How many elements do you have?
Date: 29 Nov 1995 10:01:21 -0800
Organization: Mentally ill venomous purists with strange emotional drives
Message-ID: <49i75h$opm@crl12.crl.com>
References: <49g0dq$309$1@mhafc.production.compuserve.com>
Michael Hilton <100016.3262@CompuServe.COM> wrote:
>The DTD I use has over 100 elements & about 20 PIs. Is that a lot?
>It certainly makes it harder writing conversions to other things.
It may help to use #FIXED attributes in the DTD. For example,
one of my current projects -- a series of MIL-STD-2167a
documents -- has very strict preparation requirements regarding
the order of paragraphs and subparagraphs. The approach I've
taken is to declare an element type for each section, and use
the SGML parser to validate that everything is present and in
the right order.
However, the processing is in most cases identical for
every section-like element. For the benefit of post-processors,
each section has a #FIXED attribute 'FORM=SECTION':
\
\
\
\
\
[ ... ]
\
\
This way, the conversion scripts only need to check the
FORM attribute value instead of dozens of different
element types.
--Joe English
jenglish@crl.com
Newsgroups: comp.text.sgml
Date: 29 Nov 1995 14:17:05 -0500
From: "Eve L. Maler" \
Message-ID: <199511291910.OAA21341@village.doctools.com>
References: <48ihmj$bge@newsbf02.news.aol.com> <48nd2s$25j@news2.delphi.com> <1995Nov20.051628.29522@sq.com>
Subject: Re: Program to take SGML source and produce a DTD
lee@sq.com (Liam R. E. Quin) wrote:
| SGML DTDs can serve two disparate functions. The first, and most
| common, is a set of prescriptive rules, saying what is allowed where.
| The second is to define a set of descriptive markup in which any
| restrictions on content models are merely for convenience in later
| processing, since the authoring is already done.
:
| If you *do* have editorial control, though, it's not so good, as it
| turns out that any errors you make in your tagging will cause the DTD
| (that is later generated automatically) to allow those errors...
|
| In this sort of case, another approach is incremental. I sometimes
| define a simple DTD, use Author/Editor to enter (or edit) some text, or
| perhaps even vi.... and then type markup into Author/Editor. I then
| change the DTD to allow the new markup, save and re-import the file
| (`open' uses the old rules file from the in-memory cache) and proceed
| in a loop until tired.
:
| Still, two approaches to DTD-writing, and two kinds of DTD, points at
| the ends of spectra, perhaps defining a rectangular co-ordinate
| space...
I can respect that people have found lots of ways to do DTD development,
but... I've spent a lot of time working on and thinking about exactly this
topic, and every time I've been involved in a project where there was a
rush to "make the DTD," by whatever means, it's been a mess. Usually it's
because (sorry, I'm probably about to offend somebody) the hacker type in
charge of the DTD was too enthusiastic about coding to stop and listen to
the people who will need to *use* the DTD.
I can see where it could be useful to spot-tag a document for content
analysis and then back-form a DTD for it, but only if: you already know
what is to be considered important, you're a DTD expert with experience at
balancing all the inherent tradeoffs (and a parser in your head :-), and
the DTD has a single consumer: you. Otherwise you'll be sorry when it
comes time to deploy the DTD in any real way: no specification, no
rationale management, no documentation, and often no "portability"
(suitability for general use) to SGML environments other than the one for
which you developed it originally.
There can be many motivations for tagging particular pieces of information,
and many ways to do it. It's not a reductionist task of "identifying the
data so it doesn't get lost." There's an art to defining what (in your
particular case) counts as "data" (being sensitive to your goals and tools
for creation, management, processing, etc.); naming the element types and
attributes; choosing methods for identifying the data (multiple elements?
same element in multiple contexts? attribute value?); and so on.
Implementing a chosen DTD design is easy, whether it's in vi (my usual
weapon of choice) or Near\&Far; *doing* the design is not a trivial task.
Regards,
Eve
--
\
\
\
\
From: jsbell@acs.ucalgary.ca (Joshua Sean Bell)
Newsgroups: comp.infosystems.www.authoring.html,comp.text.sgml,alt.fan.mozilla
Subject: Re: hdp://
Date: 30 Nov 1995 00:57:19 GMT
Organization: University Computing Services (The University of Calgary)
Message-ID: <49ivhf$100s@ds2.acs.ucalgary.ca>
References: <47d5ro$o3j@hamilton.maths.tcd.ie> <49bcrp$qq4@cyberhome.cyberus.ca> \ \
Tim Bradshaw \ wrote:
>* Bill Moore wrote:
>> Sorry folks, but I missed something. hdp hardware download protocol ?
>
>It's an attempt to update and standardise the facilities provided by
>the well-known UUMT system (unix-unix matter transfer) provided with
>BSD 4.2 and later releases.
While we waste bandwidth (hey, Bill Gates says it'll be unlimited soon;
I guess anyone who pays to download this can send the... invoice... to
him. :)
XEROX ANNOUNCES HYPER-ETHERNET
SAN FRANCISCO, CA., Jan 7, 2010 -- Xerox today announced
Hyper-Ethernet, its fourth-generation local area network. In addition to
its ability to transmit text, data and images, Hyper-Ethernet enables the
transmission of people. "People transmission over Hyper-Ethernet," according
to Michael Liddle, V.P. of Office Systems, "will greatly reduce elevator
congestion and eliminate the need for video conferencing." Order taking for
Hyper-Ethernet will begin next month. Installation will start in Los
Angeles in the Third Quarter.
In a related announcement, Wang Labs, headquartered in Hoboken, New Jersey,
announced Super-Hyper Wangnet, its twelfth generation local area network.
According to Freddie Wang, President of Wang Labs, "Super-Hyper Wangnet will
not only transmit people over the Wangnet, but will also transmit furniture
and buildings over the interconnect and utility bands. These additional
capabilities of Super-Hyper Wangnet are vital to the emerging office of the
future." Order taking for Super-Hyper Wangnet will begin next month.
Installation has already occurred worldwide.
IBM Corporation, which has been rumored to be about to announce a local
area network since 1980, was not available for comment.
Digital Responds to Hyper-Ethernet
TEWKSBURY, MA, April 1, 2010 -- Digital Equipment announced today its
new DECNet Phase XVIII Architecture. In response to recent Xerox and
Wang improvements to Ethernet that provide people- and facility-
transportation across inter-node links, DEC's latest DECNet provides
these capabilities as well as providing for the creation of virtual
facilities and even countries. These capabilities are provided by
breakthroughs in communications technology that actually uses the
Ether as a communications medium. Through the use of a new dedicated
NANO-PDP-11/E99 gateway processor system, ETHERGATE, DECNet users can
access anywhere in the Ethereal Plane.
This development obsoletes teleconferencing, since meeting groups can
create their own common conference rooms and cafeterias, thus resolving
space, travel, and dining problems. There may be a few bugs left, as
some of the dissenting DECNet Review Group members have not been seen
since the last meeting held in such a virtual conference facility.
This breakthrough was brought about by a team of the Distributed Systems
Software and Hardware engineering teams in an effort to improve on their
Tewksbury, Massachusetts, facility. In a compromise decision, Distributed
Systems will maintain an ETHERGATE in TWOOO but it will connect directly
to their new home somewhere in the Shire of their newly defined Middle
Earth reality. Despite some difficulties, the scenery, windows, tax
breaks, pool, and racquetball courts made the relocation go quite smoothly.
Engineering Network topology will not change, as all forwarding will be
done by the TWOOO Ethereal Plane Router residing in the crater at the
former building site.
Utility packages such as Ethereal Person Transfer (EPT) and Ethereal
Facility Transfer (EFT) provide appropriate capabilities for casual
users. Sophisticated users can create ($CREATE), access ($OPEN) and
delete ($NUKE) ethereal entities transparently from high level
languages using the Ethereal Management System (EMS) package and the
Ethereal Access Protocol (EAP). An ETHERTRIEVE utility for easy
interactive use will be available shortly.
DECNet Phase XVIII follows on the success of the Phase XVI ability to
access everyone's Digital Professional Wristwatch computer system.
The lead to the current Phase XVII architecture, which has routing
capabilities that allow direct communications with the entire Earth
population's Atari home video games.
Distributed Systems architects are hard at work on the next phase of
DECNet that will include multi-plane existence network management
(using the NIECE protocol) and galaxy level routing using 64K-bit
addresses.
Digital will continue to support its Gateway products into the Prime
Material Plane. These products include an IBM ANA (Acronym-based
Network Architecture) Gateway, the TOLKIEN product that allows control
of all ring based networks, and our Mega-broad-jump-band hardware
Sorry, I don't have an attribution for the above (I'd love one,
though). It's at least 5 years old, one of the first things I came
across and saved when I first ventured onto the Internet.
The fact that the competitor is Digital not Novell certainly dates it,
but it remains funny. :)
Joshua
--
___.----~~~----.___ | MIME: jsbell@acs.ucalgary.ca
,--------.-.,-'-------------------` | WWW: http://www.ucalgary.ca/~jsbell/
`--------"-'-.,---`~~~-----~~~'
'---'-._____/
From: jamesr@magna.com.au (James Robertson)
Newsgroups: comp.text.sgml
Subject: Re: Content Markup & Blackbird
Date: 30 Nov 1995 00:38:01 GMT
Organization: Desktop Law
Message-ID: <49iuda$7k9@kettle.magna.com.au>
References: \ <49h6hb$agp@usenet1.interramp.com>
Reply-To: jamesr@desklaw.com.au
In article <49h6hb$agp@usenet1.interramp.com>, edwardj@interramp.com says...
>
>jbottoms@world.std.com (John W Bottoms) wrote:
>
>>There are too many areas to keep up with. I was reading in "Wired"
>>that Blackbird will support content markup with Microsoft Markup
>>Language (MSML?) which is based on SGML. Has any information been
>>published on how MS will determine the content structure for the
>>various communities of interest?
>
>That markup language is defined by a specific SGML DTD. The DTD is
>published to allow content providers to write translators. As I
>understand it, this DTD was defined by the Blackbird designers (and
>reviewed by some of their content partners).
>
>
Has anyone actually seen the DTD? Is it on the Microsoft Web site? If so,
where?
Any word on how compatible it is with HTML 2.0? Does it contain a lot
of horrific Word-isms, or have they done proper job?
Yours curiously,
J
--
----------------------------------------------------------------------------
James Robertson * jamesr@desklaw.com.au
Coding * Design * Layout * Newton * Windows
"Beyond the idea"
From: hayashia@dps.ntts.co.jp (Akihiro Hayashi)
Newsgroups: comp.text.sgml
Subject: elisp for mule
Date: 30 Nov 1995 03:47:48 GMT
Organization: NTT Software Corp., Yokohama, Japan
Message-ID: <49j9h4$cmc@nttsgw.yh.ntts.co.jp>
Hi, Netters!
Do you know some elisp which helps me make sgml, DTD files,
like cmutex.el or html-helper-mode.el... ?
--------
Akihiro Hayashi \
From: Dathan@delta.pp.fi
Newsgroups: comp.text.sgml
Subject: HTML and SGML Authoring
Date: 30 Nov 1995 11:59:22 GMT
Organization: DELTA COMPUTERS LTD OY
Message-ID: <49k6aq$3kn@idefix.eunet.fi>
Reply-To: Dathan@delta.pp.fi
Summary: FREE DEMOS
Keywords: HTML,SGML
We are pleased to announce the use of our WWW home page for FREE
fully featured working demonstrations of the SGML TagWizard and WebWizard
products for authoring HTML and SGML.
URL:http://www.jsp.fi/delta
From: bkline@cortex.nlm.nih.gov (Bob Kline)
Newsgroups: comp.text.sgml
Subject: Q: what's a caron?
Date: 30 Nov 1995 12:22:01 GMT
Organization: National Library of Medicine
Message-ID: <49k7l9$lj3@lhc.nlm.nih.gov>
I've been scrounging around trying to track down the meaning of the word
"caron" which I came across in one of the ISO documents on character sets
(ISO 8879:1986) while looking for the official SGML element names for the
uppercase and lowercase "Polish L" characters (as the USMARC standard
refers to them). The ISO document doesn't refer to anything as a "Polish
L" but does have two possible candidates (that is, two each for the
lowercase and uppercase versions of Polish L), referred to as "L caron"
and "L stroke." The "L stroke" seems possible because the characters in
question do have a short stroke through the middle of the vertical, from
lower left to upper right (and in fact, the older USMARC documents refer
to the characters as "L slash" which seems close enough to "L stroke").
But I can't rule out "L caron" as a possibility because I can't determine
what kind of mark "caron" refers to. Unfortunately, the ISO documents
themselves (or at least the versions to which I have access) don't
provide the graphic represntation of the characters, and I don't have any
really specialized reference sources to turn to. Things like the Oxford
English Dictionary (both the original and new editions) or Webster's
International Unabridged (both 2nd and 3rd editions) don't have
anything. I even tried a Polish dictionary, but since I don't know
anything about Polish, I'm afraid I wasn't able to make much intelligent
use of that source.
Can anyone tell me what a "caron" is? Email response if possible,
please. Thanks!
Bob Kline
bkline@cortex.nlm.nih.gov
bob_kline@stream.com
From: kmoran@riatax.com (Kevin J. Moran)
Newsgroups: comp.text.sgml
Subject: Re: SGML book?
Date: Thu, 30 Nov 1995 16:59:42 GMT
Organization: Research Institute of America, New York, NY
Message-ID: <49ko8g$4cj@news.tpp.com>
References: <480i5v$hsp@usenet.ucs.indiana.edu> \ <1995Nov20.045420.29244@sq.com> <1995Nov20.214942.10037@sq.com> \ \
Reply-To: kmoran@riatax.com
liora@delphi.com wrote:
\
>Excerpts from ABCD... SGML are available at:
>
>http://www.thomson.com/itcp/liora/LA_HOME.HTML
>
Try http://www.thomson.com/itcp/liora/LA_HOME.HTM instead.
From: jenglish@crl.com (Joe English)
Newsgroups: comp.text.sgml
Subject: Re: Program to take SGML source and produce a DTD
Date: 30 Nov 1995 11:14:52 -0800
Organization: Mentally ill venomous purists with strange emotional drives
Message-ID: <49kvrc$rva@crl2.crl.com>
References: <48ihmj$bge@newsbf02.news.aol.com> <48nd2s$25j@news2.delphi.com> <1995Nov20.051628.29522@sq.com> <199511291910.OAA21341@village.doctools.com>
Eve L. Maler \ wrote:
>[...] every time I've been involved in a project where there was a
>rush to "make the DTD," by whatever means, it's been a mess. Usually it's
>because (sorry, I'm probably about to offend somebody) the hacker type in
>charge of the DTD was too enthusiastic about coding to stop and listen to
>the people who will need to *use* the DTD.
>
>I can see where it could be useful to spot-tag a document for content
>analysis and then back-form a DTD for it, but only if: you already know
>what is to be considered important, you're a DTD expert with experience at
>balancing all the inherent tradeoffs (and a parser in your head :-), and
>the DTD has a single consumer: you. Otherwise you'll be sorry when it
>comes time to deploy the DTD in any real way: no specification, no
>rationale management, no documentation, and often no "portability"
>(suitability for general use) to SGML environments other than the one for
>which you developed it originally.
Hear, hear! I can attest to this personally. (And wish I had
heard your advice when I first learned the mechanics of SGML...)
Of course one should not underestimate the usefulness of
hacking together DTDs for personal use; there's no better
way to gain the requisite experience. Every DTD I write is
for my use alone; I figure after another year's practice or
so I'll be ready to do it for real :-)
--Joe English
jenglish@crl.com
From: jeffrey@hks.com (Glenda Jeffrey)
Newsgroups: comp.text.sgml
Subject: Marked sections inside entities?
Date: 30 Nov 1995 18:25:23 GMT
Organization: Hibbitt, Karlsson & Sorensen, Inc.
Message-ID: <49ksuj$8qd@argo.hks.com>
I hope someone can suggest a way to do what I'm trying to
do here. I need to set up a bunch of entity definitions
(hundreds, actually) that look something like this:
\The Title\,
Section \\" >
I'd like for these definitions to be compiled into my DTD. (I'm
using ArborText Adept.) Adept doesn't seem to like this situation
unless I define %booka; and %bookb; before defining myxref.
I tried adding these definitions to my DTD:
\
\
but then, of course, Adept just replaces %booka; and %bookb; with
"INCLUDE" -- not much use.
I've read about the MS keyword, as in:
\
but this setup means that my entire entity definition has to be
a single marked section -- not what I'm after.
Does anyone have any clever ways of doing what I'm trying to do
here?
A little background, in case it's not already clear what I'm trying
to do: I have two books which are produced from the same SGML source.
I do the cross referencing within the books using an autogenerated
list of entities, like %myxref; above. (I could autogenerate them on the fly,
but this would mean I'd have to print the entire book every time --
impractical at best.) I'd like to use marked sections %booka; and %bookb;
to select which cross reference will get picked up when I print.
Thanks in advance for any suggestions!
--
Glenda Jeffrey Email: jeffrey@hks.com
Hibbitt, Karlsson & Sorensen, Inc Phone: 401-727-4200
1080 Main St. Fax: 401-727-4208
Pawtucket, RI 02860
From: amgreene@ATHENA.MIT.EDU (Andrew Marc Greene)
Newsgroups: comp.text.sgml
Subject: Re: Content Markup & Blackbird
Date: 30 Nov 1995 22:55:15 GMT
Organization: Massachusetts Institute of Technology
Message-ID: <49lcoj$365@senator-bedfellow.MIT.EDU>
References: \
In article \ jbottoms@world.std.com (John W Bottoms) writes:
>There are too many areas to keep up with. I was reading in "Wired"
>that Blackbird will support content markup with Microsoft Markup
>Language (MSML?) which is based on SGML. Has any information been
>published on how MS will determine the content structure for the
>various communities of interest?
Blackbird Markup Language (BBML or BML, the folks at Microsoft can't
seem to make up their minds :-) is basically HTML minus forms minus
nested lists plus OLE embedding.
To claim that Blackbird is SGML is as inaccurate as to claim that
"the Web" is SGML. Both Blackbird and "Web browsers" use SGML
document types; Blackbird actually enforces its DTD (as opposed to
Netscrape, for instance); but neither Blackbird nor any of the
mainstream HTML-based browsers are capable of handling any DTD
except the one hardocded into their calcified little arteries.
- Andrew Greene
[Disclaimer: I co-authored the original Blackbird Word import and
export filters about six months ago. For that I humbly apologize
to the SGML community and to the world at large.]
--
Andrew M. Greene \ http://www.mit.edu:8001/people/amgreene
Newsgroups: comp.text.sgml
From: robinw@andyne.on.ca (Robin Wynne-Edwards)
Subject: SGML 95 - Document Databases
Message-ID: \
Sender: -Not-Authenticated-[4949]
Xdisclaimer: No attempt was made to authenticate the sender's name.
Organization: Andyne Computing Ltd.
Date: Thu, 30 Nov 1995 20:17:22 GMT
Andyne Computing Limited will be sponsoring two focus groups on The
Future of Structured Document Databases during SGML 95 in Boston. The
focus groups are part of an Andyne research initiative, code named
Rosetta, that will be launching an object oriented document database in
the third quarter of 1996.
The Agenda is:
Introduction
What Should a Document Database Do (and Not Do)
Non-SGML File Support Issues
Document Query Wish List
The Ideal SGML Product Suite
The Focus Groups will be held in
The Clarendon Room,
Third Floor,
Sheraton Boston
on Tuesday December 5 at 3.00pm and again at 4.00pm.
SGML Users and VARs are invited to attend.
If you are interested in attending please look for the sign up sheets
on bulletin boards at the conference or respond by eMail. Space is
limited.
Newsgroups: comp.text.sgml
From: pbg (Paul Grosso)
Subject: Re: Is there a thing such as http://www.sgmlopen.home?
Message-ID: <1995Nov30.152635.15058@arbortext.com>
Sender: news@arbortext.com
Organization: ArborText, Inc. 1000 Victors Way Ann Arbor, Michigan 48108
Date: Thu, 30 Nov 1995 15:26:35 GMT
> From news Wed Nov 29 18:45:45 1995
> Newsgroups: comp.text.sgml
> From: kaikow@standards.com
> Subject: Is there a thing such as http://www.sgmlopen.home?
> Date: Wed, 22 Nov 1995 22:28:18 GMT
>
> Or something like that?
I'm not sure why this is dated Nov 22, but someone on the 29th just
gave some unexpected response, and I haven't seen the right response,
so excuse me if this is a repeat.
It's http://www.sgmlopen.org
paul
Paul Grosso
Chief Technology Officer, SGML Open
paul@arbortext.com
===================================
From: Francis Cave \
Newsgroups: comp.text.sgml
Subject: Re: Program to take SGML source and produce a DTD
Date: Thu, 30 Nov 95 23:22:54 GMT
Organization: Pira International
Message-ID: <817773774snz@pira2.demon.co.uk>
References: <48b8rc$qpd@manuel.anu.edu.au>
Reply-To: cave@pira2.demon.co.uk
There's a clever gadget called the DREAM Parser that might
interest you, but I'm not sure how available it is, if at all. It was
developed at GMD-IPSI in Darmstadt and, as I understand it (which, as a
non-user, is necessarily only to a limited extent), is a tool for converting
to SGML with or without a pre-existing DTD. The rules that one writes to
perform the conversion are declaration-like and can be used to generate a DTD
"post hoc", or so I believe. I suppose this implies that the person writing
the conversion must have the DTD in their head to start with. Being lazy, I
find the idea of a tool that would enable me to write a DTD _and_ the
conversion program at one go very appealing.
I too like the statistical approach to DTD derivation from tagged text.
When a colleague of mine was recently faced with working out a DTD suitable
for coding 30 years-worth of journal articles - having decided that none of
the public DTDs was quite right and with no clear picture as to what coding
would be likely to prove important in the future - it was decided that the
best policy was to write a DTD that describes what is there and leave the
worrying about what else might be needed until the need arises.
--
Francis Cave