undefined

upvote

points

by CiaranMcNulty225 days ago |

upvote

by Cthulhu_224 days ago|

[-]

It's been 84 years but I still miss some of the "basics" of XML in JSON - a proper standards organization, for one. But things like schemas were (or, felt like) so much better defined in XML land, and it took nearly a decade for JSON land to catch up.

Last thing I really did with XML was a technology called EXI, a transfer method that converted an XML document into a compressed binary data stream. Because translating a data structure to ASCII, compressing it, sending it over HTTP etc and doing the same thing in reverse is a bit silly. At this point protobuf and co are more popular, but imagine if XML stayed around. It's all compatible standards working with each other (in my idealized mind), whereas there's a hard barrier between e.g. protobuf/grpc and JSON APIs. Possibly for the better?

reply

upvote

by bokchoi224 days ago|

[-]

I just leaned about EXI as it's being used on a project I work on. It's quite amazingly fast and small! It is a binary representation of the xml stream. It can compress quite small if you have an xmlschema to go with your xml.

I was curious about how it is implemented and I found the spec easy to read and quite elegant: https://www.w3.org/TR/exi/

reply

upvote

by sumtechguy224 days ago|

[-]

That data transform thing xslt could do was so cool. You could twist it into emitting just about any other format and XML was the top layer. You want it in tab delimited yaml. Feed it the right style sheet and there you go. Other system wants CSV. Sure thing different style sheet and there you go.

For a transport tech XML was OK. Just wasted 20% of your bandwidth on being a text encoding. Plus wrapping your head around those style sheets was a mind twister. Not surprised people despise it. As it has the ability to be wickedly complex for no real reason.

reply

upvote

by chrisweekly224 days ago|

[-]

84 years? nope.

reply

upvote

by rwmj224 days ago|

[-]

XML is fine. A bit wordy, but I appreciate its precision and expressiveness compared to YAML.

XPath is kind of fine. It's hard to remember all the syntax but I can usually get there with a bit of experimentation.

XSLT is absolutely insane nonsense and needs to die in a fire.

reply

upvote

by cturner224 days ago|

[-]

It depends what you use it for. I worked on a interbank messaging platform that normalised everything into a series of standard xml formats, and then used xslt for representing data to the client. Common use case - we could rerender data to what a receiver’s risk system were expecting in config (not compiled code). You could have people trained in xslt doing that, they did not need to be more experienced developers. Fixes were fast. It was good for this. Another time i worked on a production pipeline for a publisher of education books. Again, data stored in normalised xml. Xslt is well suited to mangling in that scenario.

reply

upvote

by tclancy224 days ago|

[-]

That's funny, I would reverse those. I loved XSLT though it took me a long time for it to click; it was my gateway drug to concepts like functional programming and idempotency. XPath is pretty great too. The problem was XML, but it isn't inherent to it -- it empowered (for good and bad) lots of people who had never heard of data normalization to publish data and some of it was good but, like Irish Alzheimer's, we only remember the bad ones.

reply

upvote

by kllrnohj224 days ago|

[-]

The game Rimworld stores all its game configuration data in XML and uses XPath for modding and it's so incredibly good. It's a seriously underrated combination for enabling relatively stable local modifications of data. I don't know of any other game that does this, probably because XML has a reputation of being "obsolete" or whatever. But it's just such a robust system for this use case.

https://rimworldwiki.com/wiki/Modding_Tutorials/PatchOperati...

reply

upvote

by tannhaeuser224 days ago|

[-]

> bloat of '00s enterprise XML

True, and it's even more sad that XML was originally just intended as a simplified subset of SGML (HTML's meta syntax with tag inference and other shortforms) for delivery of markup on the web and to evolve markup vocabularies and capabilities of browsers (of which only SVG and MathML made it). But when the web hype took over, W3C (MS) came up with SOAP, WS-this and WS-that, and a number of programming languages based on XML including XSLT (don't tell HNers it was originally Scheme but absolutely had to be XML just like JavaScript had to be named after Java; such was the madness).

reply

upvote

by codeulike225 days ago|

[-]

Xpath would have been nice if you didnt have to pedantically namespace every bit of every query

reply

upvote

by masklinn224 days ago|

[-]

That… has nothing to do with xpath?

If your document has namespaces, xpath has to reflect that. You can either tank it or explicitly ignore namespaces by foregoing the shorthands and checking `local-name()`.

reply

upvote

by codeulike224 days ago|

[-]

Ok. Perhaps 'namespace the query' wasnt quite the right way of explaining it. All I'm saying is, whenever I've used xpath, instead of it looking nice like

/*bookstore/*book/*title

its been some godawful mess like

/*[name()='bookstore']/*[name()='book']/*[name()='title']

... I guess because they couldn't bear to have it just match on tags as they are in the file and it had to be tethered to some namespace stuff that most people dont bother with. A lot of XML is ad-hoc without a namespace defined anywhere

Its like

Me: Hello Xpath, heres an XML document, please find all the bookstore/book/title tags

Xpath: *gasps* Sir, I couldn't possibly look for those tags unless you tell me which namespace we are in. Are you some sort of deviant?

Me: oh ffs *googles xpath name() syntax*

reply

upvote

by masklinn224 days ago|

[-]

> the tags as they are in the file

Is not actually relevant and is not an information the average XML processor even receives. If the file uses a default namespace (xmlns), then the elements are namespaced, and anything processing the XML has to either properly handle namespaces or explicitly ignore namespaces.

> A lot of XML is ad-hoc without a namespace defined anywhere

If the element is not namespaced xpath does not require a prefix, you just write

    //bookstore/book/title

reply

upvote

by ndriscoll224 days ago|

[-]

I don't recall ever needing to do that for unnamespaced tags. Are you sure the issue you're having isn't that the tags have a namespace?

my:book is a different thing from your:book and you generally don't want to accidentally match on both. Keeping them separate is the entire point of namespaces. Same as in any programming language.

reply

upvote

by rhdunn224 days ago|

[-]

Newer versions of XPath and XSLT allow

    /*:bookstore/*:book/*:title

reply

upvote

by somat224 days ago|

[-]

Can confirm, Working programaticly with XML is not really that bad, there is a well formed query syntax(xpath), the dom api just works.

Until some joker decided to employ xml namespaces, then everything turns ugly real fast. I am not sure I can articulate why it is so unpleasant, something about how everything gets super verbose and api now needs all sorts of extra state.

reply

upvote

by tootie224 days ago|

[-]

I never enjoyed XSLT. It always felt like a square peg for a round hole. I do miss XML though. It had so, so many power features that too few people knew how to use. XSD was incredibly good for domain modeling. It had an include systems for composing files. And nobody really made good use of mixed content, but it was a crazy powerful feature. You embed structured content in unstructured content inside structured content.

reply

upvote

by int_19h224 days ago|

[-]

The original idea was good: having a purely declarative language running on the client which just does the model -> view transformation, and having the server serve the models. XSLT as an implementation of that idea is pretty bad, but mostly because using XML as the underlying syntax for a PL is very unergonomic. If the initial version of XSLT looked more like XQuery does, I think it would have been a lot more popular.

reply

upvote

by tootie224 days ago|

[-]

Yeah, the idea makes sense. More sense than CSS which ended requiring years and years of refinement and pre-processors to be usable.

reply

upvote

by aitchnyu224 days ago|

[-]

In the 2003 The Art of Unix Programming, the author advocated bespoke text formats and writing parsers for them. Writing xml by hand is his list of war crimes. Since then syntax highlighting and autocomplete and autoformatting narrowed the effort gap and tolerant parsers (browsers being the main example) got a bad rap. Would Markdown and Yaml exist with modern editors?

reply

upvote

by maxloh224 days ago|

[-]

However, XML is actually a worse format to transfer over the internet. It's bloated and consumes more bandwidth.

reply

upvote

by JimDabell224 days ago|

[-]

XML is a great format for what it’s intended for.

XML is a markup language system. You typically have a document, and various parts of it can be marked up with metadata, to an arbitrary degree.

JSON is a data format. You typically have a fixed schema and things are located within it at known positions.

Both of these have use-cases where they are better than the other. For something like a web page, you want a markup language that you progressively render by stepping through the byte stream. For something like a config file, you want a data format where you can look up specific keys.

Generally speaking, if you’re thinking about parsing something by streaming its contents and reacting to what you see, that’s the kind of application where XML fits. But if you’re thinking about parsing something by loading it into memory and looking up keys, then that’s the kind of application where JSON fits.

reply

upvote

by rwmj224 days ago|

[-]

Only if you never use compression.

reply

upvote

by bokchoi224 days ago|

[-]

Check out EXI. It compresses the xml stream into a binary encoding and is quite small and fast:

https://www.w3.org/TR/exi/

reply