XML patterns

List overview All Threads
Download

newer

older

Squeak Chat Friday July 22nd 05:00...

On AppRegistry and its use

stéphane ducasse

18 Jun 2005 18 Jun '05

9:23 p.m.

serializing a simple object in XML is simple. Now I was wondering if there are good practices to serialize and reload composed objects.

For example: if I have an attribute containing a date, should I do the readFromString by hand as a postaction?

if I have an attribute containing a collection, should I recreate the collection after loading?

Are there mechanism where we can describe what to do for each of the document tags? May be I'm dreaming but this sounds so obvious.

Stef

Show replies by date

Hans N Beck

19 Jun 19 Jun

11:11 a.m.

Hi Stef,

Am 18.06.2005 um 21:23 schrieb stéphane ducasse:

...

hi

serializing a simple object in XML is simple. Now I was wondering if there are good practices to serialize and reload composed objects.

For example: if I have an attribute containing a date, should I do the readFromString by hand as a postaction?
if I have an attribute containing a collection, should I  
recreate the collection after loading?

I'm not shure what this example should show. Do you mean to have patterns for serializing object to XML in Squeak or in general ?

But one remark I would add: in the first step, the tree build from the XML stream should first reflect the XML structure, that means you can access a node having text node child, attribute node child and so on. The values are given as strings, but with type infos. From this the original object is created, by using this XML nodes and additional informations like culture info, which allows to interpret the strings in the value child nodes correct (date, time, currency, seperators etc).

So is your example above positioned in the first step (XML Node) or second (object creation) ?

Greetings

Hans

stéphane ducasse

12:16 p.m.

On 19 juin 05, at 11:11, Hans N Beck wrote:

...

Hi Stef,

Am 18.06.2005 um 21:23 schrieb stéphane ducasse:

...
hi

serializing a simple object in XML is simple. Now I was wondering if there are good practices to serialize and reload composed objects.

For example: if I have an attribute containing a date, should I do the readFromString by hand as a postaction?
if I have an attribute containing a collection, should I  
recreate the collection after loading?
I'm not shure what this example should show. Do you mean to have patterns for serializing object to XML in Squeak or in general ?

But one remark I would add: in the first step, the tree build from the XML stream should first reflect the XML structure, that means you can access a node having text node child, attribute node child and so on. The values are given as strings, but with type infos. From this the original object is created, by using this XML nodes and additional informations like culture info, which allows to interpret the strings in the value child nodes correct (date, time, currency, seperators etc).

So is your example above positioned in the first step (XML Node) or second (object creation) ?

I have an object that I want to serialize in XML now this object contains a date object of course I can save it explicitly as a string and do a readFrom.... But I wanted to know if there is a more advance serialization process.

Yar Hwee Boon

12:26 p.m.

On 19-Jun-05, at PM 06:16, stéphane ducasse wrote:

...

I have an object that I want to serialize in XML now this object contains a date object of course I can save it explicitly as a string and do a readFrom.... But I wanted to know if there is a more advance serialization process.

I'm not sure if I understand you correctly, are you looking for a library for object serialisation to XML? I remember there is at least 1 of those on SM. Or you can just rip the serialization code out from the Squeak XML/RPC client?

-- HweeBoon http://motionobj.com/blog/

Hans N Beck

11:05 p.m.

Hi,

...

I have an object that I want to serialize in XML now this object contains a date object of course I can save it explicitly as a string and do a readFrom.... But I wanted to know if there is a more advance serialization process.

What means "more advance" ? In the last consequence, you always must read/write strings ? I could imagine that it would perhaps an advantage if you could express serialization in mappings of objects, something like f: A -> B where A is the set of syntactical possible objects in Language A and B is a set of allowed objects in XML DOM (-

...

the nodes), so you can say the object "collection" maps to

<collection><item><itemClass>...</ itemClass><itemValue>...<itemValue></item>...</collection>. So you have a API level in the mid (like .Net) Or completly different ?

Greetings

Hans

Lex Spoon

4 Jul 4 Jul

2:30 a.m.

...

I have an object that I want to serialize in XML now this object contains a date object of course I can save it explicitly as a string and do a readFrom.... But I wanted to know if there is a more advance serialization process.

My usual pattern for this is to put xmlForExport methods on every class I want to be able to save, and then have decodeXML: methods on the class side of these same classes. Finally, I add a master decodeXML: method somewhere in my system that looks at the tag and then calls decodeXML: on the proper class. At first, the master decodeXML: method has a long list of ifTrue:'s in it to select the class to restore, but if the system lasts long enough I'll end up making a Dictionary that maps XML tag names into class objects.

Actually, usually I don't use XML per se if I am just doing Squeak work. I convert to trees of arrays, analagous to s-expressions, and I don't convert small objects such as integers, floats, and points -- classes that are extremely unlikely to change for the duration of time that my external files should last. I then export and import the array trees using SmartRefStream. This is Squeak-specific, but it is a little more convenient and it does not cause problems with cross-version compatibility.

There is a meme going around that you should use an automatic serializer for this kind of thing, but if you do so then in practice you sacrifice cross-version compatibility. That is, if you use an automatic serializer -- be it SIXX, SmartRefStream, or ImageSegment -- then files you create today will probably not be loadable after your software has evolved for several months. The problem is that you will tend to refactor your code as it evolves, creating new classes and eradicating old ones, and thus the serialized files will specify classes that no longer exist.

There is substantial room for improvement in this pattern, but it is somewhere to start. I yearn for the day that serialization is treated more seriously, the day when software folks lose their fanatacism with low-level file formats (XML) and silver bullets (automatic serialization).

-Lex

Mikael Kindborg

15 Jul 15 Jul

8:22 a.m.

Just want to mention the Universal Binary Format (UBF) suggested by Joe Armstrong (one of the poeple behind Erlang). It is meant to be an alternative to XML. Here is a pointer: http://www.sics.se/~joe/ubf/site/home.html Best, Micke

On 7/4/05, Lex Spoon lex@cc.gatech.edu wrote:

...

...
I have an object that I want to serialize in XML now this object contains a date object of course I can save it explicitly as a string and do a readFrom.... But I wanted to know if there is a more advance serialization process.

My usual pattern for this is to put xmlForExport methods on every class I want to be able to save, and then have decodeXML: methods on the class side of these same classes. Finally, I add a master decodeXML: method somewhere in my system that looks at the tag and then calls decodeXML: on the proper class. At first, the master decodeXML: method has a long list of ifTrue:'s in it to select the class to restore, but if the system lasts long enough I'll end up making a Dictionary that maps XML tag names into class objects.

Actually, usually I don't use XML per se if I am just doing Squeak work. I convert to trees of arrays, analagous to s-expressions, and I don't convert small objects such as integers, floats, and points -- classes that are extremely unlikely to change for the duration of time that my external files should last. I then export and import the array trees using SmartRefStream. This is Squeak-specific, but it is a little more convenient and it does not cause problems with cross-version compatibility.

There is a meme going around that you should use an automatic serializer for this kind of thing, but if you do so then in practice you sacrifice cross-version compatibility. That is, if you use an automatic serializer -- be it SIXX, SmartRefStream, or ImageSegment -- then files you create today will probably not be loadable after your software has evolved for several months. The problem is that you will tend to refactor your code as it evolves, creating new classes and eradicating old ones, and thus the serialized files will specify classes that no longer exist.

There is substantial room for improvement in this pattern, but it is somewhere to start. I yearn for the day that serialization is treated more seriously, the day when software folks lose their fanatacism with low-level file formats (XML) and silver bullets (automatic serialization).

-Lex

Brenda

17 Jul 17 Jul

11:52 p.m.

Lex Spoon wrote:

...

...
I have an object that I want to serialize in XML now this object contains a date object of course I can save it explicitly as a string and do a readFrom.... But I wanted to know if there is a more advance serialization process.

My usual pattern for this is to put xmlForExport methods on every class I want to be able to save, and then have decodeXML: methods on the class side of these same classes.

I was doing something similar to this, but my application requires that I be able to export the same objects to multiple different XML formats, many of which have identically named elements with different internal structure. The structure of the nodes in any given export format does not correspond particularly well to that of my native objects. I need to be able to add new export formats quickly and easily.

To solve my problem, I have partially implemented a library whose main class is called XSDClassFactory. It takes an XML Schema Definition. If you don't already have one, you can construct an XSD from one or several XML files using publically available tools. XSDClassFactory reads the XSD and creates classes with the corresponding structure in Squeak. The classes know how to read, write & validate XML which adheres to the corresponding XSDs, and you can access the data using instance variable accessors instead of the usual YAXO methods, which means code that uses the generated classes is a lot clearer. I am trying to get this library finished up & publically released in the next month or two; I've been lazy & haven't implemented all the standard/built-in XML types yet. I think a couple of pre-alpha testers would be helpful at this point; contact me off-list if you'd like to be one.

Here is part of my XSDType class comment about how to do mappings with XSDClassFactory classes. Alpha, not fully tested, may change, blah blah blah, but it is a pattern to use as a straw man.

---- * You will need to manually define mapping methods to make this work for new subclasses. I wrote XSDClassFactory to make it easy for me to teach an application new XML formats, so all the mappings in my system are between my native types (SourceClass) and XSDType (TargetClass) subclasses, which I treat as a static output (and occasionally input) format. Here is what I am doing:

- Use the naming convention TargetClass>>fromSourceClass: for all conversion methods, in both directions. This naming convention is preferred to SourceClass>>asTargetClass because the protocol of the SourceClass and the TargetClass are typically different. - At the top of the SourceClass hierarchy, define a method called SourceClass>>convertToFormat: which takes a TargetClass, finds the most specific fromSourceClass: message that particular TargetClass will understand, and passes back its answer. - In SourceClass, define a registry containing interesting TargetClasses. - Construct the save as.. menu from the registry. - To output to a new XML format, run an XSD through XSDClassFactory, add the resulting TargetClasses to the appropriate SourceClass registries, and write some fromSourceClass: methods in each TargetClass. ----

Note that the above does not solve the problem Lex was solving with the case statement & future dictionary, wherein the system can automatically instantiate the correct classes based on the XML received. Solving that problem using only XML element names will inherently suffer from XML namespace collision issues. I think there are sometimes cues in the attributes (maybe explicit namespace declaration?), but I haven't looked into it.

Brenda

6871

Age (days ago)

6900

Last active (days ago)

squeak-dev@lists.squeakfoundation.org

7 comments

6 participants

tags (0)

participants (6)

Brenda
Hans N Beck
Lex Spoon
Mikael Kindborg
stéphane ducasse
Yar Hwee Boon