4DOMStandards-Based XML and HTML manipulation using PythonFourthought, Inc.http://4Suite.org2000-11-014Suite 0.9.2 released.2000-10-114Suite 0.9.1 released.2000-09-204Suite 0.9.0 released. 4DOM is now bundled therein2000-07-244DOM 0.10.2 released2000-06-064DOM 0.10.1 released2000-05-244DOM 0.10.0 released2000-03-164DOM 0.9.3 released2000-01-254DOM 0.9.2 released2000-01-034DOM 0.9.1 released1999-12-194DOM 0.9.0 released1999-10-214DOM 0.8.2 released1999-09-144DOM 0.8.1 released1999-08-314DOM 0.8.0 released1999-02-074DOM 0.7.0 released1998-11-204DOM 0.6.1 released1998-11-044DOM 0.6.0 released
4DOM is a Python implementation of the document object model (DOM), a standard interface for manipulating XML and HTML documents developed by the World-Wide Web Consortium. 4DOM implements DOM Level 2 Core, Level 2 HTML and Document Traversal, and a few extensions.
4DOM is designed to allow developers rapidly design applications that read, write or manipulate HTML and XML.
The current version is bundled with 4Suite . See the ChangeLog for notes on the current version.
Download 4Suite install one of the binary packages. Alternatively, you can download the source instead and install using Python distutils as follows:
python setup.py install
See PACKAGES for more information about the available 4Suite packages. Be sure to check the README in the 4Suite package for more details.
xml/domCore DOM components (including XML classes)xml/dom/htmlHTML componentsxml/dom/extExtensions and proprietary componentsxml/dom/docs(currently minimal) documentationDOCUMENTATION_PATH/4Suite/4DOM/demoSmall scripts demonstrating some uses of 4DOM. See the README in this directory.DOCUMENTATION_PATH/4Suite/4DOM/test_suiteTest scripts.Accessors/Mutators for Attributes
Following discussion on the Python XML SIG mailing list, 4DOM provides two ways to access DOM interface attributes. As an example, the DOM IDL definition for the Node interface contains readonly attribute DOMString childNodes. This can be accessed as a simple Python attribute: node.childNodes, or as a method call using the Python/CORBA mapping for attributes: node._get_childNodes() [if childNodes were a read/write attribute, there would also be a node._set_childNodes()]. There is a slight speed advantage to using the latter convention.
Document._get_ownerDocument()Document._get_ownerDocument() returns a pointer to itself.Creating HTML Element Nodes
HTMLDocument.createElement() overrides the Document.CreateElement() method, looking up the specified tag and returning an instance of the propriate HTML node. For instance:
# html_doc is an instance of HTMLDocument
table_elem = html_doc.createElement("TABLE")
# table_elem is an instance of HTMLTableElement
4DOM does not implement DOMString. Instead, the interfaces use a plain Python string instead. Note that Python strings do not have length limitations, and unicode is still in beta.
The DOM Spec section on the removeAttribute method of the Element interface has some rules for Attribute removal with respect to default values. 4DOM only follows these rules if you remove attributes using the removeAttribute method, and the default attribute will not be properly set if you use removeNamedItem to remove an attribute from the NamedNodeMap returned by Element.getAttributes.
The DOM ambiguously specifies that if the given name in the removeNamedItem method of NamedNodeMap not found, None is returned and an exception is raised. This isn't possible in most languages. 4DOM chooses to return None.
4DOM does not implement HTMLElement features strictly for browser environment, for example, blur and focus properties of HTMLSelectElement.
Some methods of the DOM spec for HTML do not allow for errors associated with missing nodes. So, for example, HTMLDocument::setTitle() does not allow for the return of an error if the HTMLDocument does not have an HTMLHeadElement child. 4DOM, in these cases, will automatically add in needed elements in order to strictly follow the DOM interface spec. The methods for which 4DOM provides automatic document completion are:
HTMLDocument::getDocumentElement()HTMLDocument::setTitle()HTMLDocument::getBody()HTMLDocument::setBody()HTMLTableElement::insertRow()HTMLTableRowElement::insertCell()
See 4DOM Extensions for documentation of proprietary extensions and helper functions provided by 4DOM.
For release notes and news, see http://4Suite.org
The 4Suite users and support mailing list can be subscribed to, and archives viewed at http://lists.fourthought.com/mailman/listinfo/4suite
4Suite developers monitor the above list, and prefer for support to come thereby, but you can also contact them directly at support@4suite.org with questions and comments. You might also post messages to or check the archives of the Python xml-sig mailing list.