Author Archives: admin

New Laptop, New Linux Distro

Having returned to running my own company, I decided it was time to retire my 5-yo MacBook Pro and get something modern to run Linux in. After careful consideration I decided on a Dell Precision M3800 since it’s actually being sold by Dell with Ubuntu 14.04 pre-installed. The M3800 is thin and light, in spite of the 15.6″ screen, robustly built and includes a 4k screen. Simply put, it is gorgeous.

I didn’t order the Ubuntu version, though, partly since I actually need Windows every now and then, but mostly because there’s the “free” Windows 10 upgrade once it becomes available, and I’m curious. Instead, I added a second hard disk for the Linux install. The extra disk can be fitted if opting for a smaller battery, and the installation didn’t void the warranty, since Dell actually accepts that people will want to tinker with their machines (beat that, Apple!).

After careful consideration, a few live USB sticks and one test install of Ubuntu, I have now set up Linux Mint 17.2 as my primary OS. It handles the HiDPI 4k screen beautifully, except for some older apps with hard-coded font sizes and such (shame on you, Skype!) and most Java-based programmes I have tried so far. oXygen is pretty much the only Java app I really need, so for now I’ve doubled every font size in the preferences, which makes oXygen usable. The toolbars are still tiny, but I am now able to work.

All in all, I’m really pleased.

John Nash Killed in Crash

Leave a reply

The American mathematician and Nobel Prize for Economics winner John Nash was killed in a car crash yesterday. Most people probably know the name from Ron Howard’s 2001 film A Beautiful Mind, but those of us with an interest in mathematics are more likely to remember him as one of the foremost minds in game theory.

Today is a sad day.

Mr Smith Goes to Washington

Leave a reply

My paper submission to this year’s Balisage conference was accepted. It’s about an eXist implementation I did for the Swedish Federation of Farmers (LRF), and while I may not be completely objective, I think the system is very cool. From the conference blurb:

The Federation of Swedish Farmers (LRF) provides its 170,000 members with a web-based service to check compliance with state and EU farming regulations. These checklists are also produced nightly both as generic checklists with more than 130 pages and as individualised checklists for registered members. The system consists of an eXist database coupled with oXygen Author. The checklists and their related contents are edited, stored, and processed, published as PDFs, and exported to the SQL database which stores member registration, feeds the website, and does various other tasks. The system uses XQuery, XSLT, XInclude modularization, an extended XLink linkbase, and other markup technologies. It currently handles more than 40,000 PDF documents a year and many more than that in the web-based forms.

This is the second version of the LRF system. The first, presented at XML Prague in 2013, was XProc-based and represented my somewhat naive trust in the state of XProc in eXist, The new one I rewrote in XQuery, having tested (and failed miserably at using) the XProc module that is now available. XProc in eXist, sadly, is not yet ready for prime time.

Be as it may, I’m really pleased about both the system and my paper. and hope to see you there.

Home

Leave a reply

Miss it.

LinkedIn Spam

Leave a reply

LinkedIn seems to be testing new ways to make money after their recent revenue forecast cuts. My inbox had a “sponsored message”, something I don’t recall having seen before.

Sorry, LinkedIn, but you’ll have to come up with something else. I’d rather delete my account.

In the UK

Leave a reply

I’m currently in the UK, spending a month on site to get to know my new client, LexisNexis UK. Legal publishing is a new field for me, I must confess, but they are using pretty much every XML technology there is and I’m a bit like a 5-yo in a toy store. It doesn’t hurt that the people I’m working with are both nice and knowledgeable, either.

oXygen XML Editor

Leave a reply

My friends at Syncro Soft, the makers of oXygen XML Editor, very kindly provided me with an oXygen license to replace the one I used while at Condesign. As oXygen is my tool of choice and the one I use daily, as necessary for me as a C compiler is for Linux kernel programmers, I remain in absolute awe of both the product and the kind and generous human beings that make it.

Consider this post as my heartfelt thank you.

Changes

Leave a reply

I’ll be going back to running my own company, Creative Words, again. In short, I got an offer that I simply couldn’t refuse (no, not that kind of offer), and so here I am. It is exciting and fun, but also a little scary.

I’ll update this page when it’s time.

The Uniqueness of Things

Leave a reply

Found the below in my Drafts folder, unearthed after I imported my old blog to the WordPress instance on my own server. While it was written six years ago, I thought it was still worth publishing after I read it. I hope you think so too.

Two years after writing this (and having long since forgotten that I did), I presented the concepts behind URNs and the need for uniqueness in document management at XML Finland. The system was finished and done, and I was proud of it. It wasn’t perfect but it was battle-tested and we knew about its weaknesses. I really wanted to talk about it with other markup people, colleagues who knew about angled brackets, and I was sure they’d understand. In fact, I feared some might say they implemented it all years ago, only better. Yet, what is described here also happened at XML Finland; the importance of uniqueness and the advantages of semantic naming using URNs went right past them, judging by the Q&A afterwards.

Or maybe it’s just that I’m wrong.

Anyway, here goes…

===

I’ve been busy finalising an authoring system that is supposed to identify every resource ever stored in it with URNs. What follows is just a rant, but I do think about it and would like to know the why’s and the how’s. I would like to know why the concept of uniqueness is so difficult to understand.

A URN, of course, is the unique name of a document, as opposed to its location, the URL. Compare with a book in a library. Sometimes books get reorganised in a library, meaning that they will be put on another shelf (another address), but the name will remain the same. The name is unique while the address is not. When identifying content to be reused, this is the principle you need to honour.

Anyway…

It’s been my primary concern all along to ensure that everything is identified with a URN. Everything. If you create a document and link to another, meaning to insert that other document in the one you’re editing, the link should take the form URN#id, where the hash separates the name of the document from a node pointed out within the document when checked into the database. When checked out, in the XML editor, however, the form should be URL#id, since URLs are what most authoring systems can handle; we need the URL for styling the document in the editor, to publish it, and to process it in various ways.

A URN is possible, of course, but it needs to be replaced with a URL when processing, one way or another, so the decision was to use a URL when a resource has been checked out and replace it with a URN when checked in.

Early on, we did make a demo application that opened a document containing URNs pointing to other documents, replaced them with the corresponding URLs, normalised the resulting document, and published it using XSL and FOP. It worked like a charm.

Today, I found that the check-in does not replace the URLs with URNs. The file name is a pseudo-URN (with colons replaced by underscores) so I know my URN scheme is being used, but that’s as far as it goes. The URN-like file names remain.

Talking to a developer, I realised that he hadn’t even thought about it. He was using URNs to identify the resources in the database (the URN being an attribute on the object) but in spite of all our planning, all of our tests, the URLs were left in the links when the document containing them had been checked in. The object IDs in the database are unique, he said, but yes (he admitted), the file names are being used in the database so we can’t store two identically named files in the same folder in the database.

This is not a major problem since we already have the code to do all the work, but what surprises me is that nobody made the connection. Me, I assumed everyone had understood but did not check. I simply assumed that following the test, following the discussions, following the months of development, no-one could fail to understand their true meaning.

Wrong.

What is it that makes the concept of URNs so difficult?

Tommy Emmanuel in Concert

Leave a reply

I went to see Tommy Emmanuel do all kinds of things to and with a guitar at the GÃ¶teborg Concert Hall last Sunday. It was my second Tommy Emmanuel concert, and I have to say it was even better than the first, in December 2012.

I think everybody should attend at least one Tommy Emmanuel concert in their lifetimes.

sgmlguru.org

on markup, film projection and more!

Author Archives: admin

New Laptop, New Linux Distro

John Nash Killed in Crash

Mr Smith Goes to Washington

Home

LinkedIn Spam

In the UK

oXygen XML Editor

Changes

The Uniqueness of Things

Tommy Emmanuel in Concert