Last Modified: 1/29/08
  Computer Training
Introduction to XML

HTML Is Not Enough

HTML is the most successful programming language ever created, so why move on to something else?

Problems with HTML

  • For Presentation Only
    HTML is a presentation language intended to specify how a page will appear when displayed in a browser. HTML tags allow you to identify text blocks as headings, paragraphs, list items, etc., but not by other logical types such as price, dates, or location

    Even in terms of presenting information HTML is very limited, in that much of its content is in an aggregate form which cannot be analyzed, searched, manipulated, or edited. For example, consider the presentation of mathematical equations with integrals, matrices, and the like. In HTML this is usually presented as a GIF or PostScript file, and while it looks pretty, you cannot *do* anything with it -- you can't locate common sub-expressions, substitute one expression for another, or feed the GIF file into a symbolic mathematical analyzer.

    Or consider a 3D graphic image: in HTML this is either presented as a static picture, or as a link to a special purpose program that allows only those manipulations explicitly enabled by the programmer.

  • Flexible Syntax
    HTML syntax is not strict, making it difficult to develop dynamic page designs controlled by programming languages such as JavaScript. In addition, its loose syntax invites varying interpretation by different browsers.

  • No New Tags
    You cannot add new tags to HTML. The set of tags available is fixed and cannot be extended.

  • Dumb Searches
    Because HTML is not "aware" of what a text block is, other than how it is to be presented, searches of HTML documents usually must be string searches, thus, the term "Penguin" could refer to a bird, a hockey team, or a publishing firm. Searching for text strings alone creates huge numbers of false matches.

  • High Bandwidth Needs
    Since HTML cannot effectively process, edit, or manipulate information, except through special purpose scripts, interactions with data bases on servers require extensive back-and-forth communication between client and server.

  • Limited International Capabilities
    Lack of full and consistent Unicode support limits the ability of HTML to support different character sets for the worlds languages.

Previous Home Next

Topics

Summary

HTML Is Not Enough

What Is XML?
  Ontologies
  SGML, HTML, & XML

XML Basics
  HTML Example
  XML File
  Structure
  Paths
  Well-Formed
  DTDs
  Schemas
  Validation
  Unicode
  What It Means

Transforming For
Presentation

  DHTML
  CSS
  XSL

Serving And Processing XML
  Server Side
  Client Side

XML Applications   Information Reuse
  B2B
  Text Encoding
  Syndication

Security

XML Resources On The Web

Part Two Of Class

 
Previous Home Next

Other Topics:   XML Editors

©1999 UW Technology