JabChapter 9

From WikiContent

Revision as of 23:00, 22 May 2009 by RacgeTcaze (Talk | contribs)
Jump to: navigation, search



Groupchat, : Components, and Event Models

By now, you should have a

good idea of how scripts interact with Jabber and how the core
elements such as <message/> and
<presence/> can be constructed and handled.

This chapter builds upon what we've already seen in Chapter 8 and introduces new concepts. We build a nosy assistant that joins a conference room and alerts us to words and phrases that we want it to listen for. There are two popular conference protocols, as mentioned in Section 6.2.6—the presence-based Groupchat protocol, and the jabber:iq:conference-based Conference protocol. The assistant recipe, a foray into the world of 'bots, takes a look at the original presence-based one.

As we've seen, programming within Jabber's event model is fairly straightforward. But what happens when you want to meld other components with event models of their own? We look at a couple of typical scenarios where this melding needs to happen. The first is a homage to the Trojan Room Coffee Machine (http://www.cl.cam.ac.uk/coffee/coffee.html), where we give life, or at least presence, to a coffeepot, using LEGO® MINDSTORMS™. The second is a Tk-based RSS headline viewer. Both the coffeepot and the Tk programming library have event loops of their own. With the coffeepot, we need to have a loop that polls the coffeepot's status, independently of the polling for incoming packets from the Jabber server. The Tk programming library's event model is similar to those of the Jabber programming libraries that are used in the recipes in this book, in that handlers are set up and a loop is started that listens for UI events. In both cases, we need to get these event loops to work in harmony with the Jabber libraries' event loops.

Building Jabber solutions without event loops is a Sisyphean task. The very nature of Jabber communication is event-based, and it's important to understand how to use the event features of the Jabber programming libraries and also how to be able to mesh those features with similar features in other libraries and systems.

We also look at extending messages and build a mechanism that delivers RSS headlines to clients who register with that mechanism. These headlines are carried using an extended message type. In fact, the RSS Delivery Mechanism is a component. The three recipes in Chapter 8, were Jabber clients, in that they connected to the Jabber network via the Jabber Session Manager (JSM) service. We look at the differences between programming a client and programming a component in this chapter and build a complete component that can be queried and interacted with using the third of Jabber's building blocks—the <iq/> element.

Happy coding!

Keyword Assistant

Many of the Jabber core and peripheral developers hang out in a conference room called jdev hosted by the Conferencing component on the Jabber server running at jabber.org. While a lot of useful information is to be gleaned from listening to what goes on in jdev, it isn't possible to be there all the time. Conversations in jdev are logged to web pages, which can be used to visit after the fact to try to catch up with things; however, this can be a hopeless task. One solution is to build a 'bot that looks for specific keywords and Uniform Resource Locators (URLs) in the conversations in jdev and send those on as Jabber messages.

This script, keyassist, connects to a Jabber server, enters a conference room, and listens to the conversations, looking for certain words and phrases to be uttered. The keyassist script is given a bit of "intelligence" in that it can be interacted with and told, while running, to watch for (or stop watching for), certain words and phrases.

The keyassist script introduces us to programmatic interaction with the Conferencing component. Before looking at the script, however, let's have a brief overview of Conferencing in general.


The Conferencing component at jabber.org is conference.jabber.org. Details of the component instance configuration for such a Conferencing component can be found in Section 4.10, where we see that the component exists as a shared object library connected with the library load component connection method. This component provides general conferencing facilities, oriented around a conference room and conference user model.

A Jabber user can enter (or join) a conference room, thereby becoming a conference user identified by a nickname that is chosen upon entering that room. Nicknames are generally used in conference rooms to provide a modicum of privacy—it is assumed that by default you don't want to let the other conference room members know your real JID.

The Conferencing component supports two protocols for user and room interaction: a simple one that provides basic features and a more complex one that provides the basic features plus facilities such as password-protected rooms and room descriptions—Groupchat and Conference.

{{Note|There is a third protocol, called Experimentaliq:groupchat, which came between the Groupchat and Conference protocols. This reflected an experimental move to add features to the basic Groupchat protocol using IQ elements, the contents of which were qualified by a namespace jabber:iq:groupchat. This protocol has been dropped, and support for it exists only in certain versions of WinJab and JIM.


The Groupchat protocol is the simpler of the two and provides basic
functions for entering and exiting conference rooms and choosing
nicknames. : This Groupchat protocol is known as the presence-based
protocol, because the protocol is based upon
<presence/> elements used for room entry, exit, and
nickname determination. The Groupchat protocol has a nominal version
number of 1.0.
The Conference protocol offers more advanced features than the
Groupchat protocol and makes use of two IQ namespaces:
jabber:iq:conference and jabber:iq:browse. It has a
nominal protocol version number of 1.4, which reflects the version of
the Jabber server with which it is delivered. Sometimes this version
number is referred to as 0.4, such as in the downloadable tarball and
in the value returned in response to a "version query" on the
component itself, as shown in Example 9-1. : The version number isn't
that important. The main thing to keep in mind is that the component
that is called conference.so (see the reference to the
shared object library in Section 4.10.4) supports both the
Groupchat protocol and the Conference protocol. If you come across a
shared object library called groupchat.so, this is the
original Conferencing component that was made available with Jabber
server Version 1.0. This library supports only the Groupchat protocol.

Querying the Conferencing component's version

SEND: <iq type='get' to='conference.gnu.mine.nu'> <query xmlns='jabber:iq:version'/> </iq>

RECV: <iq to='dj@gnu.mine.nu/jarl' from='conference.gnu.mine.nu' type='result'> <query xmlns='jabber:iq:version'> <name>conference</name> <version>0.4</version> <os>Linux 2.4.2-2</os> </query> </iq> In this recipe we'll be using the simpler Groupchat protocol. It's widely used and easy to understand. Example 9-2 shows a typical element log from Groupchat-based activity. It shows a user, with the JID qmacro@jabber.com, entering a room called "cellar," hosted on the conference component at conf.merlix.dyndns.org, a room that currently has two other occupants who go by the nicknames flash and roscoe. The elements are from qmacro's perspective, and are all explained following the example.

The Groupchat protocol in action The user qmacro tries to enter the conference room with the nickname flash and fails:

SEND: <presence to='cellar@conf.merlix.dyndns.org/flash'/>

RECV: <presence to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/flash' type='error'> <error code='409'>Conflict</error> </presence>

He tries again, this time with a different nickname, deejay, and is successful:

SEND: <presence to='cellar@conf.merlix.dyndns.org/deejay'/>

RECV: <presence to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/flash'/>

RECV: <presence to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/roscoe'/>

RECV: <presence to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/deejay'/>

RECV: <message to='qmacro@jabber.com/jarltk' type='groupchat' from='cellar@conf.merlix.dyndns.org'> <body>deejay has become available</body> </message>

roscoe says hi, and qmacro waves back:

RECV: <message to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/roscoe' type='groupchat' cnu=> <body>hi</body> </message>

SEND: <message to='cellar@conf.merlix.dyndns.org' type='groupchat'> <body>/me waves to everyone</body> </message>

flash sends a private message to qmacro:

RECV: <message to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/flash' type='chat'> <body>Is that you, qmacro?</body> <thread>jarl1998911094</thread> </message>

Feeling left out of the conversation, roscoe leaves the room:

RECV: <presence to='qmacro@jabber.com/jarltk' type='unavailable' from='cellar@conf.merlix.dyndns.org/roscoe'/>

RECV: <message to='qmacro@jabber.com/jarltk' type='groupchat' from='cellar@conf.merlix.dyndns.org'> <body>roscoe has left</body> </message> Let's take the stages in Example 9-2 one by one.

Failed attempt to enter room
qmacro makes an attempt to enter the room using the Groupchat
protocol. This is done by sending a directed
<presence/> element to a particular JID that represents
the room and the chosen nickname. This JID is constructed as follows:

[room name]@[conference component]/[nickname]

In this example, the conferencing component is identified with the
hostname conf.merlix.dyndns.org. qmacro's choice of
nickname is flash:


Thus the following element is sent:

SEND: <presence to='cellar@conf.merlix.dyndns.org/flash'/>

The conference component determines that there is already someone
present in the room cellar@conf.merlix.dyndns.org with the
nickname flash, so qmacro is notified of this and
receives a directed presence with an <error/> tag:

RECV: <presence to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/flash' type='error'> <error code='409'>Conflict</error> </presence>

Note that the <presence/> element has the type
error and comes from the artificial JID constructed in the
room entry attempt. The element is addressed to qmacro's real
JID, of course—qmacro@jabber.com/jarltk—as otherwise it
wouldn't reach him. : The error code 409 and text "Conflict" tells
qmacro that the nickname conflicted with one already in the
room. This is a standard error code/text pair; Table 5-3 shows a
complete set of code/text pairs. : At this stage, qmacro is
not yet in the room.
Successful attempt to enter room
qmacro tries again, this time with a different nickname,

SEND: <presence to='cellar@conf.merlix.dyndns.org/deejay'/>

This time, there is no conflict—no other user is in the room "cellar"
with that nickname—and the conference component registers the entry.
It does this by sending qmacro the presence of all the room
occupants, including that of himself:

RECV: <presence to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/flash'/>

RECV: <presence to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/roscoe'/>

RECV: <presence to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/deejay'/>

These presence elements are also sent to the other room occupants so
they know that deejay is present.
Conference component-generated notification
In addition to the presence elements sent for each room occupant, a
general roomwide message noting that someone with the nickname
deejay just entered the room is sent out by the component as
a type='groupchat' message to all the room occupants:

RECV: <message to='qmacro@jabber.com/jarltk' type='groupchat' from='cellar@conf.merlix.dyndns.org'> <body>deejay has become available</body> </message>

The text "has become available" used in the body of the message is
taken directly from the Action Notices definitions, part of the
Conferencing component instance configuration described in Section
4.10.3. Note that the identity of the room itself is simply a generic
version of the JID that the room occupants use to enter:


Roomwide chat
Once the user with the nickname roscoe sees someone enter the
room, he sends a greeting, and qmacro waves back:

RECV: <message to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/roscoe' type='groupchat' cnu=> <body>hi qmacro</body> </message>

SEND: <message to='cellar@conf.merlix.dyndns.org' type='groupchat'> <body>/me waves to everyone</body> </message>

As with the notification message, each message is a
groupchat-type message. The one received appears to come from
cellar@conf.merlix.dyndns.org/roscoe, which is the JID
representing the user in the room with the nickname roscoe.
This way, roscoe's real JID is never sent to qmacro.
The message deejay sends is addressed to the room's identity
cellar@conf.merlix.dyndns.org, and contains a message that
starts with /me. This is simply a convention that is
understood by clients that support conferencing, meant to represent an
action and displayed thus:

* deejay waves to everyone

{{Note|Ignore the cnu attribute; it's put there and used by the component and should never make it out to the client endpoints. The attribute name is a short name for the conference user and refers to the internal structure that represents a conference room occupant within the component.


One-on-one chat
The Conferencing component also supports a one-on-one chat mode, which
is just like normal chat mode (where messages with the type
chat are exchanged) except that the routing goes through the
component. The intended recipient of a conference-routed chat message
is identified by his room JID. So in this example:

RECV: <message to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/flash' type='chat'> <body>Is that you, qmacro?</body> <thread>jarl1998911094</thread> </message>

the user nicknamed flash actually addressed the chat message
to the JID:


which arrived at the Conferencing component (because of the hostname,
conf.merlix.dyndns.org causes the <message/>
element to be routed there), which then looked up internally who
deejay really was (qmacro@jabber.com/jarltk) and
sent it on. This way, the recipient of a conference-routed message
never discovers the real JID of the sender. In all other ways, the
actual <message/> element is like any other
<message/> element—in this case, it contains a message
<body/> and a chat <thread/>. (See
Section 5.4.1 for details on the <message/> element.)
Leaving the room
In the same way that room entrance is effected by sending an
available presence (remember, a <presence/>
element without an explicit type attribute is understood to
represent type="available'), leaving a room is achieved by
doing the opposite:

RECV: <presence to='qmacro@jabber.com/jarltk' type='unavailable' from='cellar@conf.merlix.dyndns.org/roscoe'/>

The people in the conference room are sent a message that
roscoe has left the room by the unavailable presence
packet. This is by and large for the benefit of each user's client, so
that the room occupant list can be updated. The component also sends
out a verbal notification, in the same way as it sends a verbal
notification out when someone joins:

RECV: <message to='qmacro@jabber.com/jarltk' type='groupchat' from='cellar@conf.merlix.dyndns.org'> <body>roscoe has left</body> </message>

Like the join notification, the text for the leave notification ("has
left") comes directly from the component instance configuration
described in Section 4.10.3.

The Script's Scope

The Keyword Assistant (keyassist) script will be written in Python using the Jabberpy library. As mentioned earlier, the script will perform the following tasks:

  • Connect to a predetermined Jabber server
  • Join a predetermined conference room
  • Sit there quietly, listening to the conversation
  • Take simple commands from people to watch for, or stop watching for, particular words or phrases uttered in the room
  • Relay the context of those words or phrases to whomever requested them, if heard In addition

to setting the identity of the Jabber server and the conference room in variables, we'll also need to keep track of which users ask the assistant for words and phrases.
We'll use a dictionary (hash in Perl terms), as shown in Example 9-3, because we want to manage the data in there by key, the JID of those users that the script will be assisting. Having a look at what this dictionary will look like during the lifetime of this script will help us to visualize what we're trying to achieve.

Typical contents of the Keyword Assistant's dictionary

{ 'dj@gnu.pipetree.com/home': { 'http:': 1, 'ftp:': 1

 'piers@jabber.org/work':                { 'Perl': 1, 'Java': 1, 'SAP
 R/3': 1
 'cellar@conf.merlix.dyndns.org/roscoe': { 'dialback': 1


We can see from the contents of the dictionary in Example 9-3 that three people have asked the script to look out for words and phrases. Two of those people—dj and piers—have interacted with the script directly by sending a normal (or chat) <message/>. The other person, with the conference nickname roscoe, is in the "cellar" room and has sent the script a message routed through the Conference component in the same way that flash sent qmacro a private message in Example 9-2: the JID of the sender belongs to (has the hostname set to) the conference component. Technically, there's nothing to distinguish the three JIDs here; it's just that we know from the name that conf.merlix.dyndns.org is the name that identifies such a component.

If we dissect the dictionary, we can see that:

  • dj wants to be notified if any web or FTP URLs are mentioned.
  • piers is interested in references to two of his favorite

languages and his favorite business software solution.

  • roscoe

is interested in any talk about dialback. We said we'd give the script a little bit of intelligence. This was a reference to the ability for users to interact with the script while it runs, rather than having to give the script a static list of words and phrases in a configuration file. dj, piers, and roscoe have done this by sending the script messages (directly, not within the room) with simple keyword commands, such as:

dj: "watch http:" script: "ok, watching for http:" dj:
"watch gopher:" script: "ok, watching for gopher:" dj:
"watch ftp:" script: "ok, watching for ftp:" dj: "ignore
gopher:" script: "ok, now ignoring gopher:"


piers: "list" script: "watching for: Perl, Java, SAP R/3"


roscoe: "stop" script: "ok, I've stopped watching"

The keyassist Script

Example 9-4 shows the keyassist script in its entirety. The script is described in detail, step by step, in the next section.

The keyassist Perl script

import jabber from string import split, join, find import sys

keywords = {}

def addword(jid, word): if not keywords.has_key(jid): keywords[jid] = {} keywords[jid][word] = 1

def delword(jid, word): if keywords.has_key(jid): if keywords[jid].has_key(word): del keywords[jid][word]

def messageCB(con, msg):

   type = msg.getType() if type == None: type = 'normal'
   # Deal with interaction
   if type == 'chat' or type == 'normal': jid = str(msg.getFrom())
       message = split(msg.getBody(), None, 1); reply = ""
       if message[0] == 'watch': addword(jid, message[1]) reply =
       "Okay, watching for " + message[1]
       if message[0] == 'ignore': delword(jid, message[1]) reply =
       "Okay, now ignoring " + message[1]
       if message[0] == 'list': if keywords.has_key(jid): reply =
       "Watching for: " + join(keywords[jid].keys(), ", ") else: reply
       = "Not watching for any keywords"
       if message[0] == 'stop': if keywords.has_key(jid): del
       keywords[jid] reply = "Okay, I've stopped watching"
       if reply: con.send(msg.build_reply(reply))

   # Scan room talk
   if type == 'groupchat': message = msg.getBody()
       for jid in keywords.keys(): for word in keywords[jid].keys(): if
       find(message, word) >= 0: con.send(jabber.Message(jid, word +
       ": " + message))

def presenceCB(con, prs):

   # Deal with nickname conflict in room
   if str(prs.getFrom()) == roomjid and prs.getType() == 'error':
   prsnode = prs.asNode() error = prsnode.getTag('error') if error: if
   (error.getAttr('code') == '409'): print "Cannot join room -
   conflicting nickname" con.disconnect() sys.exit(0)
   # Remove keyword list for groupchat correspondent
   if prs.getType() == 'unavailable': jid = str(prs.getFrom()) if
   keywords.has_key(jid): del keywords[jid]

Server = 'gnu.mine.nu' Username = 'kassist' Password = 'pass' Resource = 'py'

Room = 'jdev' ConfServ = 'conference.jabber.org' Nick = 'kassist'

con = jabber.Client(host=Server) try: con.connect() except IOError, e: print "Couldn't connect: %s" % e sys.exit(0) else: print "Connected"

if con.auth(Username,Password,Resource): print "Logged in as %s to server %s" % ( Username, Server ) else: print "Problems authenticating: ", con.lastErr, con.lastErrCode sys.exit(1)

con.setMessageHandler(messageCB) con.setPresenceHandler(presenceCB)


roomjid = Room + '@' + ConfServ + '/' + Nick print "Joining " + Room con.send(jabber.Presence(to=roomjid))

while(1): con.process(5)

Dissecting the keyassist Script

Taking keyassist step by step, the first section is probably familiar if you've seen the previous Python-based scripts in Section 8.1 and Section 8.3, both in Chapter 8.

import jabber from string import split, join, find import sys

Here, all of the functions and libraries that we'll need are brought in. We'll use the find function from the string library to help with the keyword searching.

Next, we declare the dictionary. This will hold a list of the words that the script will look for, as defined by each person, as shown in Example 9-3.

keywords = {}

Maintaining the keyword dictionary

To maintain this dictionary, we will use two subroutines to add words to and remove words from a user's word list. These subroutines are called when a command such as watch or ignore is recognized in the callback subroutine that handles incoming <message/> elements:

def addword(jid, word): if not keywords.has_key(jid): keywords[jid] = {} keywords[jid][word] = 1

def delword(jid, word): if keywords.has_key(jid): if keywords[jid].has_key(word): del keywords[jid][word]

A string representation of the JID (in jid) of the correspondent giving the command is passed to the subroutines along with the word or phrase specified (in word) by the user. The dictionary has two levels: the first level is keyed by the JID, and the second by word or phrase. We use a dictionary, rather than an array, at the second level simply to make removal of words and phrases easier.

Message callback

Next, we define the callback to handle incoming <message/> elements:

def messageCB(con, msg):

   type = msg.getType() if type == None: type = 'normal'

As usual, we're expecting the message callback to be passed the connection object (in con) and the message object itself (msg). How this callback is to proceed is determined by the type of message received. We determine the type (taken from the <message/> element's type attribute) and store it in the variable called type. Remember that if no type attribute is present, a message type of normal is assumed. (See Section for details of <message/> attributes.)

The two types of incoming messages we're expecting this script to receive are those conveying the room's conversation—in groupchat-type messages—and those over which the commands such as watch and ignore are carried, which we expect in the form of normal- or chat-type messages.

The first main section of the messageCB handler deals with incoming commands:

# Deal with interaction if type == 'chat' or type == 'normal': jid = str(msg.getFrom())

       message = split(msg.getBody(), None, 1); reply = ""
       if message[0] == 'watch': addword(jid, message[1]) reply =
       "Okay, watching for " + message[1]
       if message[0] == 'ignore': delword(jid, message[1]) reply =
       "Okay, now ignoring " + message[1]
       if message[0] == 'list': if keywords.has_key(jid): reply =
       "Watching for: " + join(keywords[jid].keys(), ", ") else: reply
       = "Not watching for any keywords"
       if message[0] == 'stop': if keywords.has_key(jid): del
       keywords[jid] reply = "Okay, I've stopped watching"
       if reply: con.send(msg.build_reply(reply))

If the <message/> element turns out to be of the type in which we're expecting a potential command, we want to determine the JID of the correspondent who sent that message. Calling the getFrom() method will return us a JID object. What we need is the string representation of that, which can be determined by calling the str() function on that JID object:

jid = str(msg.getFrom())

Then we grab the content of the message by calling the getBody() on the msg object and split the whole thing on the first bit of whitespace. This should be enough for us to distinguish a command (watch, ignore, and so on) from the keywords. After the split, the first element (index 0) in the message array will be the command, and the second element (index 1) will be the word or phrase, if given. At this stage, we also declare an empty reply:

message = split(msg.getBody(), None, 1); reply = ""

Now it's time to determine if what the script was sent made sense as a command:

if message[0] == 'watch': addword(jid, message[1]) reply = "Okay, watching for " + message[1]

       if message[0] == 'ignore': delword(jid, message[1]) reply =
       "Okay, now ignoring " + message[1]
       if message[0] == 'list': if keywords.has_key(jid): reply =
       "Watching for: " + join(keywords[jid].keys(), ", ") else: reply
       = "Not watching for any keywords"
       if message[0] == 'stop': if keywords.has_key(jid): del
       keywords[jid] reply = "Okay, I've stopped watching"

We go through a series of checks, taking appropriate action for the supported commands:

Watch for a particular word or phrase.
Stop watching for a particular word or phrase.
List the words and phrases currently being watched.
Stop watching altogether; remove the list of words and phrases. The
addword() and delword() functions defined earlier
are used here, as well as other simpler functions; one that lists the
words and phrases for a particular JID:


and one that removes them:

del keywords[jid]

If there was something recognizable for the script to do, we get it to reply appropriately:

if reply: con.send(msg.build_reply(reply))

The build_reply() function creates a reply out of a message object by setting to to the value of the original <message/> element's from attribute and preserving the element type attribute and <thread/> tag, if present. The <body/> of the reply object (which is just a <message/> element) is set to whatever is passed in the function call; in this case, it's the text in the reply variable.

Now that we've dealt with incoming commands, we need another section in the message callback subroutine to scan for the words and phrases. The target texts for this scanning will be the snippets of room conversation, which arrive at the callback in the form of groupchat-type <message/> elements:

# scan room talk if type == 'groupchat': message = msg.getBody()

The message variable holds the string we need to scan; it's just a case of checking for each of the words or phrases on behalf of each of the users who have asked:

for jid in keywords.keys(): for word in keywords[jid].keys(): if find(message, word) >= 0: con.send(jabber.Message(jid, word + ": " + message))

If we get a hit, we construct a new Message object, passing the JID of the person for whom the string has matched (in the jid variable) and the notification, consisting of the word or phrase that was found (in word) and the context in which it was found (the sentence uttered, in message). Once found and constructed, the <message/> is sent to that user. By default, the Message constructor specifies no type attribute, so the user is sent a "normal" message.

Presence callback

Having dealt with the incoming <message/> elements, we turn to the <presence/> elements. Most of those we receive in this conference room will be notifications from people entering and leaving the room, as shown in Example 9-2. We want to perform housekeeping on our keywords dictionary so the entries don't become stale. We also want to deal with the potential problem of conflicting nicknames. Let's look at that first.

We want to check for the possibility of nickname conflict problems that may occur when we enter the room, and the chosen nickname (flash) is already taken.

Remembering that a conflict notification will look something like this:

<presence to='qmacro@jabber.com/jarltk' from='cellar@conf.merlix.dyndns.org/flash' type='error'> <error code='409'>Conflict</error> </presence> we test for the receipt of a <presence/> element with the following:

def presenceCB(con, prs):

   # Deal with nickname conflict in room
   if str(prs.getFrom()) == roomjid and prs.getType() == 'error':
   prsnode = prs.asNode() error = prsnode.getTag('error') if error: if
   (error.getAttr('code') == '409'): print "Cannot join room -
   conflicting nickname" con.disconnect() sys.exit(0)

The <presence/> element will appear to be sent from the JID that we constructed for the initial room entry negotiation (in the roomjid variable further down in the script); for example:

jdev@conference.jabber.org/kassist We compare this value to the value of the incoming <presence/>'s from attribute, and also make sure that the type attribute is set to error. If it is, we want to extract the details from the <error/> tag that will be contained as a direct child of the <presence/>.

The Jabberpy library currently doesn't offer a direct high-level function to get at this tag from the Presence object (in prs), but we can strip away the presence object "mantle" and get at the underlying object, which is a neutral "node"—a Jabber element, or XML fragment, without any preconceived ideas of what it is (and therefore without any accompanying high-level methods such as getBody() or setPriority()).

{{Note|If this seems a little cryptic, just think of it like this: each of the Presence, Message, and IQ classes are merely superclasses of the base class Protocol, which represents elements generically.

</code> The asNode() method gives us what we need—a Protocol object representation of the <presence/> element. From this we can get to the <error/> tag and its contents. If we find that we do have a nickname conflict, we abort by disconnecting from the Jabber server and ending the script.

The general idea is that this script will run indefinitely and notify the users on a continuous basis, so we need to do a spot of keyword housekeeping. No presence subscription relationships are built (mostly to keep the script small and simple; you could adapt the mechanism from the recipe in Section 8.3 if you wanted to make this script sensitive to presence), so notifications will get queued up for the user if he is offline with the use of the mod_offline module of the Jabber Session Manager (JSM). This makes a lot of sense for the most part; however, we still want to have the script send notifications even if the user is offline. Additionally, a command could be sent to the script to watch for a keyword or phrase from a user within the room. We would receive the command from a JID like this:

jdev@conference.jabber.org/nickname This is a transient JID, in that it represents a user's presence in the jdev room for a particular session. If a word is spotted by the script hours or days later, there's a good chance that the user has left the room, making the JID invalid as a recipient. Although the JID is technically valid and will reach the conferencing component, there will be no real user JID that it is paired up with. Potentially worse, the room occupant's identity JID may be assigned to someone else at a later stage, if the original user left, and a new user entered choosing the same nickname the original user had chosen. See the upcoming sidebar titled "Transient and Nonexistent JIDs" for a short discussion of the difference between a transient JID and a nonexistent JID.

So as soon as we notice a user leave the room we're in, which will be indicated through a <presence/> element conveying that occupant's unavailability, we should remove any watched-for words and phrases from the dictionary:

# Remove keyword list for groupchat correspondent if prs.getType() == 'unavailable': jid = str(prs.getFrom()) if keywords.has_key(jid): del keywords[jid]

As before, we obtain the string representation of the JID using the str() function on the JID object that represents the presence element's sender, obtained via the getFrom() method.

{{Sidebar|Transient and Nonexistent JIDsWhat happens when you send a message to a "transient" conference room JID? Superficially, the same as when you send one to a nonexistent JID. But there are some subtle differences.

A transient JID is one that reflects a user's alternate identity in the context of the Conferencing component. When you construct and send a message to a conference transient JID, it goes first to the conference component because of the hostname in the JID that identifies that component, for example:

jdev@conference.jabber.org/qmacro The hostname conference.jabber.org is what the jabberd backbone uses to route the element. As mentioned earlier, the Conferencing component will relay a message to the real JID that belongs to the user currently in a room hosted by that component.

While the component itself is usually persistent, the room occupants (and so their transient JIDs) are not. When a message is sent to the JID jdev@conference.jabber.org/qmacro and there is no room occupant in the jdev room with the nickname qmacro, the message will still reach its first destination—the component—but be rejected at that stage, as shown in Example 9-5.

A message to a nonexistent transient JID is rejected

SEND: <message to='jdev@conference.jabber.org/qmacro'> <body>Hello there</body> </message>

RECV: <message to='dj@gnu.mine.nu/jarl' from='jdev@conference.jabber.org/qmacro' type='error'> <body>Hello there</body> <error code='404'>Not Found</error> </message>

Although the rejection—the "Not Found" error—is the same as if a message had been sent to a JSM user that didn't exist, the difference is that the transient user always had the potential to exist, whereas the JSM user never did. Of course, if the JID referred to a nonexistent Jabber server, then the error returned wouldn't be a "Not Found" error 404, but an "Unable to resolve hostname" error 502.


The main script

Now that we have the subroutines and callbacks set up, all we need to do is define the Jabber server and room information:

Server = 'gnu.mine.nu' Username = 'kassist' Password = 'pass' Resource = 'py'

Room = 'jdev' ConfServ = 'conference.jabber.org' Nick = 'kassist'

The kassist user can be set up simply by using the reguser script presented in Section 7.4:

$ ./reguser gnu.mine.nu username=kassist password=pass [Attempt] (kassist) Successful registration $ In the same way as in previous recipes' scripts, a connection attempt is made, followed by an authentication attempt:

con = jabber.Client(host=Server,debug=0,log=0) try: con.connect() except IOError, e: print "Couldn't connect: %s" % e sys.exit(0) else: print "Connected"

if con.auth(Username,Password,Resource): print "Logged in as %s to server %s" % ( Username, Server ) else: print "Problems authenticating: ", con.lastErr, con.lastErrCode sys.exit(1)

Then the message and presence callbacks messageCB() and presenceCB() are defined to the connection object in con:

con.setMessageHandler(messageCB) con.setPresenceHandler(presenceCB)

After sending initial presence, informing the JSM (and anyone who might be subscribed to kassist's presence) of the assistant's availability:


we also construct—from the Room, ConfServ, and Nick variables—and send the <presence/> element for negotiating entry to the jdev room hosted by the Conferencing component at conference.jabber.org:

roomjid = Room + '@' + ConfServ + '/' + Nick print "Joining " + Room con.send(jabber.Presence(to=roomjid))

The con.send() function will send a <presence/> element that looks like this:

SEND: <presence to='jdev@conference.jabber.org/kassist'/> We're sending available presence to the room, to negotiate entry, but what about the initial presence? Why do we send that too if there are no users who will be subscribed to the kassist JID? If no initial presence is sent, the JSM will merely store up any <message/> elements destined for kassist, as it will think the JID is offline.

The processing loop

Once everything has been set up, we simply need to have the script sit back and wait for incoming packets and handle them appropriately. For this, we simply call the process() function every 5 seconds to look for elements arriving on the XML stream:

while(1): con.process(5)

Connecting Devices to Jabber

LEGO MINDSTORMS. What a great reason to dig out that box of LEGO bricks you haven't touched in years. When I found out that LEGO was bringing out a programmable brick, the RCX,[2], I went to my favorite toy shop and purchased the set. In addition to the RCX (shown in Figure 9-1), the MINDSTORMS set comes with an infrared (IR) port and an IR tower, which you can connect to the serial port of your PC, a battery compartment,[3] motors, touch and light sensors, and various LEGO Technic parts.

The LEGO MINDSTORMS RCX, or "programmable brick"

</code> There are plenty of ways to interact with the RCX. The MINDSTORMS Robotics Invention System (RIS)™ set comes with Windows software with which you can build programs by moving blocks of logic around graphically on the screen and chaining them together. In addition, various efforts on the parts of talented individuals have come up with many different ways to program the RCX. The Unofficial Guide to LEGO® MINDSTORMS™ Robots (O'Reilly & Associates, Inc., 1999) tells you all you need to know about programing the RCX. What's important to know for this recipe is detailed in Programming the RCX.

{{Sidebar|Programming the RCXThere are two approaches to programming the RCX. One approach is to write a program on your PC, download it to the RCX, and start and stop the program using the buttons on the RCX itself.

The other approach is to control the RCX directly from a program that you write and execute on your PC, sending control signals and receiving sensor values over the IR connection.

Both approaches have their merits. How appropriate each one is boils down to one thing: connections. On the one hand, building autonomous machines that find their way around the kitchen to scare the cat and bring you a sandwich calls for the first approach, when, once you've downloaded the program to the RCX, you can dispense with any further connection with your PC because the entire logic is situated in your creation. On the other hand, if you want to build a physical extension to a larger system that, for example, has a connection to the Internet, the second approach is likely to be more fruitful, because you can essentially use the program that runs on your PC and talks to the RCX over the IR link as a conduit, a proxy of sorts, to other programs and systems that can be reached over the network. We're going to use the second approach.

The RIS software that comes as standard centers around an ActiveX control. While there are plenty of ways to talk to the RCX without using this control (the book mentioned earlier describes many of these ways), the features offered by the control—Spirit.ocx—are fine for many a project. And with Perl's Win32::OLE module, we can interact with this ActiveX control without having to resort to Visual Basic.


What We're Going to Do

Everyone knows that one of the virtues of a programmer is laziness. We're going to extend this virtue (perhaps a little too far) and enhance it with a hacker's innate ability to combine two favorite pastimes—programming and playing with LEGO—to build contrived but fun devices.

Often being a key part of a programmer's intake, coffee figures highly on the daily agenda. It's important to have a good cup of coffee to keep the brain cells firing, but it's even more important to know whether there's actually any coffee left in the pot. Going over to the coffeepot to find out is time away from the keyboard and therefore time wasted. So let's put the RCX to good use and build a device to tell us, via Jabber, whether the coffeepot has enough for another cup.

In building the device, a light sensor was connected to the RCX to "see" the level of coffee in the pot. Since the coffeepot is made of glass, light passes through it unless the coffee gets in the way, thus creating a simple binary switch:

  • No (or a small amount of) light measured: there's coffee in the pot.
  • Some (or a larger amount of) light: there's no coffee in the pot. We

want to be able to send the availability of coffee to all interested parties in a way that their off-the-shelf Jabber clients can easily understand and display.

Figure 9-2 shows the LEGO MINDSTORMS device in action. The brick mounted on the gantry is the light sensor, which extends to the glass coffeepot; a wire runs from it to the connector on the RCX. Behind the RCX is the IR tower, which is connected to the PC.

Our device "looking" at the coffeepot

</code> Remembering that <presence/> elements are a simple way of broadcasting information about availability and that they contain a <status/> tag to describe the detail or context of that availability (see Section 5.4.2 for details on the <presence/> element), we have a perfect mechanism that's ready to be used. What's more, most, if not all, of the off-the-shelf Jabber client implementations will display the content of the <status/> tag in the client user's roster next to the JID to which it applies. Figure 9-3 shows how the content of the <status/> tag is displayed as a hovering "tooltip" in WinJab.

Receiving information on the coffee's status in WinJab

</code> Here's what we need to do:

Step 1
Set up the RCX
We need to set the RCX up, with the light sensor, so that it's close
enough to the coffeepot to take reliable and consistent light
readings. Luckily the serial cable that comes with the MINDSTORMS set
and connects to the IR tower is long enough to stretch from the
computer to within the infrared line of sight to the RCX.
Step 2
Make the correct calibrations
There are bound to be differences in ambient light, sensitivity of the
light sensor, and how strong you make your coffee. So we need a way of
calibrating the setup, so that we can find the appropriate "pivot
point" light reading value that lies between the two states of
coffee and no coffee.
Step 3
Set up a connection to Jabber
We need a connection to a Jabber server and a client account there. We
can set one up using the reguser script from Section 7.4. We also need
the script to honor presence from users who want to be informed of the
coffee state.
Step 4
Set up a sensor poll/presence push loop
Once the RCX has been set up, the sensor calibrations taken, and the
connection has been made, we need to monitor the light sensor on the
RCX at regular intervals. At each interval, we determine the coffee
state by comparing the value received from the sensor with the pivot
point determined in the calibration step and send any change in that
state as a new availability <presence/> element
containing an appropriate description in the <status/>

The Coffee Script

We're going to use Perl and the Net::Jabber libraries to build the script shown in Example 9-6. Perl allows us a comfortable way to interact with an ActiveX control, through the Win32::OLE module, so let's have a look at the coffee script as a whole, then we'll go back and look at the script in detail.

The coffee script, written in Perl

use Net::Jabber qw(Client); use Win32::OLE; use Getopt::Std; use strict;

my %opts; getopt('ls', \%opts);

use constant SERVER => "merlix.dyndns.org"; use constant PORT => 5222; use constant USERNAME => "coffee"; use constant PASSWORD => "pass"; use constant RESOURCE => "perlscript";

use constant NOCOFFEE => 0; use constant COFFEE => 1;

use constant SENSOR => defined($opts{'s'}) ? $opts{'s'} : 0; use constant GRAIN => 1;

my $current_status = -1; my @status; $status[NOCOFFEE] = 'xa/Coffeepot is empty'; $status[COFFEE] = '/Coffee is available!';

my $rcx = &setup_RCX(SENSOR);

  1. Either calibrate if no parameters given, or run with the parameter
  2. given as -l, which will be taken as the pivot between coffee and no
  3. coffee

&calibrate($rcx) unless defined($opts{'l'});

  1. Determine initial status (will be either 0 or 1)

my $s = &set_status($rcx->Poll(9, SENSOR));

my $jabber = &setup_Jabber(SERVER, PORT, USERNAME, PASSWORD, RESOURCE, $s);

  1. Main loop: check Jabber and RCX

while (1) { defined($jabber->Process(GRAIN)) or die "The connection to the Jabber server was broken\n"; my $s = &set_status($rcx->Poll(9, SENSOR)); &set_presence($jabber, $s) if defined $s; }

  1. Set up Jabber client connection, sending initial presence

sub setup_Jabber { my ($server, $port, $user, $pass, $resource, $initial_status) = @_; my $connection = new Net::Jabber::Client;

 # Connect
 my $status = $connection->Connect( hostname => $server, port    
 => $port ); die "Cannot connect to Jabber server $server on port
 $port\n" unless $status;
 # Callbacks
 $connection->SetCallBacks( presence => \&InPresence );
 # Ident/Auth
 my @result = $connection->AuthSend( username => $user, password
 => $pass, resource => $resource ); die "Ident/Auth failed:
 $result[0] - $result[1]\n" if $result[0] ne "ok";
 # Roster
 # Initial presence dependent upon initial status
 &set_presence($connection, $initial_status);
 return $connection;


sub set_presence { my ($connection, $s) = @_; my $presence = Net::Jabber::Presence->new(); my ($show, $status) = split("/", $status[$s], 2); $presence->SetPresence( show => $show, status => $status ); print $status, "\n"; $connection->Send($presence); }

  1. Handle presence messages

sub InPresence { my $presence = $_[1]; my $from = $presence->GetFrom(); my $type = $presence->GetType();

 if ($type eq "subscribe") { print "Subscribe request ($from) ...\n";
 $jabber->Send($presence->Reply(type => 'subscribed'));
 if ($type eq "unsubscribe") { print "Unsubscribe request ($from)
 ...\n"; $jabber->Send($presence->Reply(type =>


sub setup_RCX { my $sensor = shift; my $rcx = Win32::OLE->new('SPIRIT.SpiritCtrl.1'); $Win32::OLE::Warn = 0; $rcx->{ComPortNo} = 1; $rcx->{InitComm}; $rcx->SetSensorType($sensor, 3); $rcx->SetSensorMode($sensor, 2); return $rcx; }

sub calibrate { my $rcx = shift;

 print <<EOT; Calibration mode. Note the sensor values and decide
 on a 'pivot' value above which 'no coffee' is signified and below
 which 'coffee' is signified.

End the calibration mode with Ctrl-C.

Press Enter to start calibration... EOT

 while (1) { print $rcx->Poll(9, SENSOR), " "; sleep 1;


sub set_status { my $val = shift;

 my $new_status = $val < $opts{'l'} ? COFFEE : NOCOFFEE;
 if ($new_status != $current_status) { $current_status = $new_status;
 return $current_status;
 else { return undef;


Examining the Coffee Script Step by Step

Now that we've seen the coffee script as a whole, let's examine it step by step to see how it works.

Declaring the modules, constants, and variables

We first declare the packages we're going to use. In addition to Net::Jabber and Win32::OLE, we're going to use Getopt::Std, which affords us a comfortable way of accepting and parsing command-line options. We also want to use the strict pragma, which should keep us from making silly coding mistakes by not allowing undeclared variables and the like.

We specify Client on the usage declaration for the Net::Jabber package to specify what should be loaded. The package is a large and comprehensive set of modules, and only some of those are relevant for what we wish to do in the script—build and work with a Jabber client connection. Other module sets are pulled in by specifying Component or Server.

use Net::Jabber qw(Client); use Win32::OLE; use Getopt::Std; use strict;

We're going to allow the command-line options -l and -s, which perform the following tasks:

No options specified (or just the -s options)
calibration mode.
When we run the script for the first time, we need to perform the
calibration and read values from the sensor to determine a midpoint
value. A number above the midpoint signifies the presence of light
and therefore the absence of coffee; below signifies the absence of
light and therefore the presence of coffee. This step is necessary
because not every environment (ambient light, sensitivity of the light
sensor, and so on) will be the same. The upper and lower values,
representing lightness and darkness, respectively, will vary across
different environments. The point is to obtain a value in between
these upper and lower values—the midpoint—with which we can compare a
light value read at any particular time. : If we don't specify any
options, the script will start up automatically in calibration mode:

C:\temp> perl coffee.pl

Figure 9-4 shows the script run in calibration mode. The values
displayed, one each second, represent the values read from the light
sensor. When the sensor was picking up lots of light, the values were
60. When the sensor was moved in front of some coffee, the values went
down to around 45. Based upon this small test, the pivot point value
was 50, somewhere in between those two values.
Specify the pivot value.
Once we've determined a pivot point value, we run the script and tell
it this pivot value with the -l (light pivot):

C:\temp> perl coffee.pl -l 50

Specify the sensor number.
The RCX, shown in Figure 9-1, has three connectors to which you can
attach sensors. They're the three gray 2-by-2 pieces, labeled 1, 2,
and 3, near the top of the brick. The script assumes you've attached
the light sensor to the one marked 1, which internally is 0. If you
attach it to either of the other two, you can specify the connector
using the -s (sensor) with a value of 1 (for the middle connector) or
2 (for the rightmost connector), like this:

C:\temp> perl coffee.pl -l 50 -s 2

You can specify the -s option when running in calibration or normal

{{Figure|title=Running coffee in calibration mode|image=0596002025-jab_0904.png</code> The options, summarized in Table 9-1, are defined with the Getopt::Std function:

my %opts; getopt('ls', \%opts);

Summary of the startup options -
Option Meaning
No option Start the script automatically in calibration mode.
-l Specify the pivot value.
-s Specify the sensor number.

Next comes a list of constants, which describe:

  • The script's Jabber relationship, including the server it will connect

to and the username, password, and resource it will connect with.

  • The representation of the two states of coffee and no coffee, which

will be used to determine the content of the <status/> tag sent along inside any <presence/> element emitted.

  • The identification of the connector to which the light sensor is

attached and the polling granularity of the sensor (poll/presence push loop) described earlier. This item is measured in seconds.

use constant SERVER => "merlix.dyndns.org"; use constant PORT => 5222; use constant USERNAME => "coffee"; use constant PASSWORD => "pass"; use constant RESOURCE => "perlscript";

use constant NOCOFFEE => 0; use constant COFFEE => 1;

use constant SENSOR => defined($opts{'s'}) ? $opts{'s'} : 0; use constant GRAIN => 1;

The last part of the script's setup deals with the coffee state:

my $current_status = -1; my @status; $status[NOCOFFEE] = 'xa/Coffeepot is empty'; $status[COFFEE] = '/Coffee is available!';

We use a two-element array (@status) to represent the two possible coffee states. The value of each array element is a two-part string, with each part separated by a slash (/). Each of these parts will be transmitted in a <presence/> element, with the first part (which is empty in the element representing the COFFEE state) representing the presence <show/> value and the second part representing the presence <status/> value. Example 9-7 shows what a <presence/> element looks like when built up with values to represent the NOCOFFEE state.

A presence element representing the NOCOFFEE state

<presence> <show>xa</show> <status>Coffeepot is empty</status> </presence>

Most Jabber clients use different icons in the roster to represent different <show/> values. In this case, we will use xa for no coffee and a blank (which represents "online" or "available") for coffee to trigger the icon change.

Initialization and calibration

Whenever we need to talk to the RCX, some initialization is required via the ActiveX control. That's the same whether we're going to calibrate or poll for values. The setup_RCX() function takes a single argument—the identification of which connector the light sensor is connected to—and performs the initialization, which is described later in Section The function returns a handle on the Win32::OLE object that represents the ActiveX control, which in turn represents the RCX via the IR tower:

my $rcx = &setup_RCX(SENSOR);

If the -l option is not specified, it means we're going to be running calibration. So we call the calibrate() function to do this for us. We pass the RCX handle (in $rcx) so the calibration can run properly:

# Either calibrate if no parameters given, or

  1. run with the parameter given as -l, which will be taken as the pivot
  2. between coffee and no coffee

&calibrate($rcx) unless defined($opts{'l'});

As with the setup_RCX() function, calibrate() is described later.

Calibration mode will be terminated by ending the script with Ctrl-C, so the next thing we come across is the call to the function set_status(), which represents the first stage in the normal script mode; set_status() is used to determine the initial coffee status.

A value is retrieved by calling the ActiveX control's Poll() function. (Table 9-2 lists the ActiveX control's functions and properties used in this script.) We specify that we're after a sensor value (the 9 as the first argument) from the sensor attached to the connector indicated by the SENSOR constant:

# Determine initial status (will be either 0 or 1) my $s = &set_status($rcx->Poll(9, SENSOR));

The value retrieved is passed to the set_status() function, which determines whether the value is above or below the pivot value and whether the new status is different from the current one. It's going to be something along the lines of one of the values displayed when the script was run in calibration mode. If it is (and in this case, it will be, because in this first call, the value of $current_status is set to -1, which represents neither the COFFEE nor the NOCOFFEE state), that status will be returned; otherwise, undef will be returned.

RCX Spirit.ocx ActiveX control properties and functions used -
Function/Property Description
Poll(SOURCE, NUMBER) Retrieves information from the RCX. In this script, the value for the SOURCE argument is always 9, which represents a sensor value (i.e., a value measured at a sensor), as opposed to an internal RCX variable or a timer. The NUMBER argument represents the connector to which the sensor we want to read is attached.
SetSensorMode(NUMBER, MODE [, SLOPE]) This function returns a value from the sensor. As with Poll() and SetSensorType(), NUMBER represents the sensor connector. The MODE argument can be used to determine the sensor mode, which can be Raw (mode 0), Boolean (mode 1), Transitional (mode 2), Periodic (mode 3), Percentage (mode 4), Celcius (mode 5), Farenheit (mode 6), or Angle (mode 7). The SLOPE argument qualifies the Boolean mode by specifying how True and False are to be determined.
SetSensorType(NUMBER, TYPE) This function is used to specify the type of sensor the values will be read from. The NUMBER argument is the same as for the Poll() and represents the sensor connector. The TYPE argument represents the type of sensor that you want to set: None (type 0), Switch (type 1), Temperature (type 2), Light (type 3), or Angle (type 4).
property:ComPortNo The serial port to which the IR tower is connected (e.g., 1 = COM1, 2 = COM2, and so on).
property:InitComm When invoked, the serial communication port is initialized in preparation for the IR connection to the RCX.

==== Connecting to the Jabber server ====
At this stage, we're ready to connect to the Jabber server. The call to setup_Jabber() does this for us, returning a handle to the Jabber connection object that we store in $jabber. This handle will be used later in the script to send out <presence/> elements. The $jabber variable contains a reference to a Net::Jabber::Client object. This is the equivalent of the con variable used in the earlier Python scripts to hold the jabber.Client object and the ConnectionBean object (cb) in the earlier Java script.[4]

my $jabber = &setup_Jabber(SERVER, PORT, USERNAME, PASSWORD, RESOURCE, $s);

In addition to passing the constants needed for the client connection to the Jabber server, we pass the initial coffee status, held in $s. We'll have a look at what the setup_Jabber() function does with this initial status a bit later when we get to the function's definition.

Sensor poll/presence push loop

Now that we've set everything up, determined the initial coffee status, and connected to the Jabber server, we're ready to start the main loop:

# Main loop: check Jabber and RCX while (1) { defined($jabber->Process(GRAIN)) or die "The connection to the Jabber server was broken\n"; my $s = &set_status($rcx->Poll(9, SENSOR)); &set_presence($jabber, $s) if defined $s; }

The while (1) loop is a bit of a giveaway. This script won't stop until you force it to by entering Ctrl-C—but that's essentially what we want. In the loop, we call the Process() method on the Jabber connection object in $jabber.

Process() is the equivalent of the Jabberpy's process() method in the Python scripts. Process() waits around for up to the number of seconds specified as the single argument (or not at all if no argument is specified) for XML to appear on the stream connection from the Jabber server. If complete fragments do appear, callbacks, defined in the connection object, are called with the elements (<iq/>, <message/>, and <presence/>) that the fragments represent. This is in the same way as, for example, callbacks are used in the Python scripts using the Jabberpy library. The setup_Jabber(), which will be discussed in the next section, is where the callback definition is made.

Net::Jabber's Process() method returns undef if the connection to the Jabber server is terminated while waiting for XML. The undef value is dealt with appropriately by ending the script.

The GRAIN constant, set to 1 second in the script's setup section, is used to specify how long to wait for any packets from the Jabber server. For the most part, we're not expecting to receive much incoming Jabber traffic—the occasional presence subscription (or unsubscription) request perhaps (see later), but other than that, the only packets traveling over the connection to the Jabber server will be availability <presence/> packets representing coffee state changes, sent from the script. This delay is normally set to 1 second. And because that's a comfortable polling interval for the light sensor, we can set that within the same loop.

Calling the ActiveX control's Poll() again with the same arguments as before ("get a sensor value from the sensor attached to the SENSORth connector"), we pass the value to the set_status() to determine the coffee state. If the state was different from last time (if $s receives a value and not undef), then we want to emit a <presence/> element to reflect that state. We achieve this by calling the set_presence() function, passing it the connection object and the state.

The setup_Jabber() function

Here we define the setup_Jabber() function, which is called to set up the connection to the Jabber server and authenticate with a predefined user:

# Set up Jabber client connection, sending intial presence sub setup_Jabber { my ($server, $port, $user, $pass, $resource, $initial_status) = @_; my $connection = new Net::Jabber::Client;

 # Connect
 my $status = $connection->Connect( hostname => $server, port    
 => $port ); die "Cannot connect to Jabber server $server on port
 $port\n" unless $status;
 # Callbacks
 $connection->SetCallBacks( presence => \&InPresence );
 # Ident/Auth
 my @result = $connection->AuthSend( username => $user, password
 => $pass, resource => $resource ); die "Ident/Auth failed:
 $result[0] - $result[1]\n" if $result[0] ne "ok";
 # Roster
 # Initial presence dependent upon initial status
 &set_presence($connection, $initial_status);
 return $connection;


First, we instantiate a new Net::Jabber::Client object. Net::Jabber distinguishes between client- and component-based connections to Jabber; the component-based equivalent of this class is Net::Jabber::Component. The Connect() method is passed arguments that specify the hostname and port of the Jabber server to connect to. It returns a 0 status if the connection could not be made.

We can register handlers for Jabber elements received over the XML stream carried by the connection we just made. Here we are interested in incoming presence subscription or unsubscription requests, as we'll see in the definition of the InPresence() function.

The single method, SetCallBacks(), does what the collective jabber.Client methods setPresenceHandler(), setMessageHandler(), and setIqHandler() do in a single call—taking a list of element types and subroutine references, in the form of a hash.

After registering the callback for <presence/> elements, it's time to authenticate, passing the username, password, and resource defined in the list of constants at the start of the script. If authentication is successful, the result of the call to the AuthSend() method is a single string with the value ok. If not, that value is replaced with an error code and the descriptive text is available in a further string. (This is why we catch the results of a call in an array, called @result.) A complete list of Jabber error codes and texts can be found in Table 5-3.

Why RosterGet()? We're not subscribing to anyone, and we're not really interested in anything but the values we're polling from our brick. So theoretically there's no reason to make a request to retrieve our roster from the server. However, because we want the script to receive and process subscription and unsubscription requests, we need to request the roster beforehand; otherwise, the JSM won't send such requests to us. See Section in Chapter 8 for an explanation as to why.

Once we've requested the roster, so as to receive presence subscription and unsubscription requests, the job is almost done. The last thing to do in setting up the Jabber connection is to send initial availability information. The setup_Jabber() function receives the initial coffee status as the last argument in the call (in $initial_status), which it passes on to the function that sends a <presence/> element, set_presence(). Along with the initial coffee status, we also send the $connection object that represents the connection to the Jabber server that we've just established (referred to outside of this function with the $jabber variable). This is so the set_presence() function can use the connection handle to send the element down the stream.

The set_presence() function

This function is used by setup_Jabber() to send the script's (and therefore the coffee's) initial presence. It's also used within the main sensor poll/presence push loop to send further presence packets if the coffee's state changes.

sub set_presence { my ($connection, $s) = @_; my $presence = Net::Jabber::Presence->new(); my ($show, $status) = split("/", $status[$s], 2); $presence->SetPresence( show => $show, status => $status ); print $status, "\n"; $connection->Send($presence); }

On receipt of the Jabber connection object and the coffee status, which will be 0 (NOCOFFEE) or 1 (COFFEE), set_presence() constructs a new Net::Jabber::Presence object. This object represents a <presence/> element, upon which we can make method calls to hone the element as we wish. SetPresence() is one of these methods, with which we can set values for each of the <show/> and <status/> tags. We retrieve the values for each of these tags by pulling the strings from the appropriate member of the @status array, as described earlier in Section

We print the coffee's status (remember, this function is called only when the status changes, not every time the sensor is polled) and send the newly built <presence/> element down the XML stream to the Jabber server. This is accomplished by passing the presence object as an argument to the Send() method of the connection object in $connection. This works in the same way as the send() function in Jabberpy and the send() function in JabberBeans. Everyone who has subscribed to the script user's presence, and who is available, will receive the coffee status information.

Figure 9-3 shows the status information received in the WinJab client. The string sent in the <status/> tag is shown in the tooltip that appears when the mouse hovers over the "coffee" roster item.

The InPresence() subroutine

Our presence handler, the callback subroutine InPresence(), honors requests for subscription and unsubscription to the script user's (and therefore the coffee's) presence. This callback is designed to work in the same way as the presenceCB() callback in the Python recipe described in Section 8.3.

However, while the Python Jabberpy library hands to the callbacks a jabber.Client object and the element to be handled, the Perl Net::Jabber library hands over a session ID and the element to be handled. Don't worry about the session ID here; it's related to functionality for building Jabber servers, not clients, and we can and should ignore it for the purposes of this recipe. What is important is the element to be handled, which appears as the second argument passed to the subroutine collected by the $presence variable from $_[1].

What is common between the two libraries is that the element that is passed to be handled as the subject of the callback is an instance of the class that the callback represents. In other words, a callback is used to handle <presence/> elements, and the element received is an instance of the Net::Jabber::Presence class (just as the element received by a Jabberpy presence callback is an instance of the jabber.Presence class).

# Handle presence messages sub InPresence { my $presence = $_[1]; my $from = $presence->GetFrom(); my $type = $presence->GetType();

 if ($type eq "subscribe") { print "Subscribe request ($from) ...\n";
 $jabber->Send($presence->Reply(type => 'subscribed'));
 if ($type eq "unsubscribe") { print "Unsubscribe request ($from)
 ...\n"; $jabber->Send($presence->Reply(type =>


With an object in $presence, we can get information from the element using data retrieval methods such as those used here: GetFrom() and GetType(), which extract the values from the from and type attributes of the <presence/> element, respectively.

If the <presence/> element type represents a subscription request (type="subscribe'), we unquestioningly honor the request, by sending back an affirmative reply. The Reply() method of the presence object is one of a number of high-level functions that make it possible to turn elements around and send them back. In this case, the method replaces the value of the <presence/>'s to attribute with the value of the from attribute, and preserves its id. It also allows us to pass arguments as if we were calling the SetPresence() method described earlier. Rather than set the <show/> and <status/> tags as we did earlier in the set_presence() function, we merely set the element's type attribute to subscribed or unsubscribed, depending on the request.

So, with an incoming <presence/> element in $presence that looks like this:

<presence from='qmacro@jabber.org/office' type='subscribe' to='coffee@merlix.dyndns.org' id='21'>

calling the Reply() method would cause the element in $presence to change to this:

<presence to='qmacro@jabber.org/office' type='subscribed' id='21'>

Remember, the from attribute on elements originating from the client is set by the server, not by the client. The script doesn't ask for a subscription to the user's presence in return. The script isn't interested in whether the people who have subscribed to its presence are available—its purpose is to let people know whether there's any coffee left in the pot.

The setup_RCX() function

This function is called once every time the script is started and is required to initialize the RCX:

sub setup_RCX { my $sensor = shift; my $rcx = Win32::OLE->new('SPIRIT.SpiritCtrl.1'); $Win32::OLE::Warn = 0; $rcx->{ComPortNo} = 1; $rcx->{InitComm}; $rcx->SetSensorType($sensor, 3); $rcx->SetSensorMode($sensor, 2); return $rcx; }

A Win32::OLE object representing the RCX's ActiveX control Spirit is instantiated. A Win32::OLE function is used to suppress warnings, and the RCX is initialized by setting the COM port to COM1 for serial communications. The sensor type and mode are set for the light sensor attached to the connector identified by the value passed into the $sensor variable. Table 9-2 shows us that sensor type 3 represents Light, and sensor mode 2 specifies a Transitional measurement mode, the upshot of which is that the values returned on a poll are all within a certain restricted range, which makes it easier to decide whether there's any coffee in the pot.

We return the Win32::OLE RCX object to be used elsewhere in the script for calibration and polling.

The calibrate() function

The calibrate() function is called if the script is started without the -l option. This function simply prints a message, waits for the user to press Enter, and then goes into a a gentle loop, emitting whatever value was polled from the light sensor so the user can determine the pivot point:

sub calibrate { my $rcx = shift;

 print <<EOT; Calibration mode. Note the sensor values and decide
 on a 'pivot' value above which 'no coffee' is signified and below
 which 'coffee' is signified.

End the calibration mode with Ctrl-C.

Press Enter to start calibration... EOT

 while (1) { print $rcx->Poll(9, SENSOR), " "; sleep 1;


The output produced from this function can be seen in Figure 9-4.

The set_status() function

The set_status() function receives the latest light value as polled from the sensor and compares it with the pivot value. If the status defined in $new_status is different from the current status (in $current_status), then the current status is updated and returned; otherwise, undef is returned:

sub set_status { my $val = shift;

 my $new_status = $val < $opts{'l'} ? COFFEE : NOCOFFEE;
 if ($new_status != $current_status) { $current_status = $new_status;
 return $current_status;
 else { return undef;


If this function returns a status value, a new <presence/> element is generated and emitted by the script. Otherwise, there's no change ("the coffee's still there," or "there's still no coffee!") and nothing happens.

An RSS News Agent

While the Jabber clients available off the shelf are orientated toward receiving (and sending) messages from other people, the possibilities don't stop there, as is clear from the recipes we've seen already. In this recipe, we're going to build a Jabber component that retrieves news items from various sources on the Web and sends them on to Jabber users who have expressed an interest in receiving them. We're going to use the Web for our news sources, but they could just as easily be sources within a corporate intranet. The key thing is that the sources are available in a readily parseable format.

RSS (RDF[5] Site Summary or, alternatively, Really Simple Syndication) is an XML format used for describing the content of a web site, where that site typically contains news items, diary entries, event information, or generally anything that grows, item by item, over time. A classic application of RSS is to describe a news site such as JabberCentral (http://www.jabbercentral.org). JabberCentral's main page (see Figure 9-5) consists of a number of news items—in the "Recent News" section—about Jabber and its developer community. These items appear in reverse chronological order, and each one is succinct, sharing a common set of properties:

Each item has a title ("JabberCon Update 11:45am - Aug 20").
Short description
Each item contains a short piece of text describing the content and
context of the news story ("JabberCon Update - Monday Morning").
Link to main story
The short description should be enough to help the reader decide if he
wants to read the whole item. If he does, there's a link ("Read More")
to the news item itself.

{{Figure|title=JabberCentral's main page|image=0596002025-jab_0905.png</code> It is this collection of item-level properties that are summarized in an RSS file. The formality of the XML structure makes it a straightforward matter for:

  • Automating the retrieval of story summaries for inclusion in other sites (syndication)
  • Combining these items with items from other similar sources (aggregation)
  • Checking to see whether there is any new content (new items) since the last visit Example 9-8 shows what the RSS XML for JabberCentral's news items shown in Figure 9-5 looks like.

RSS source for JabberCentral

<?xml version="1.0" encoding="ISO-8859-1"?>

<!DOCTYPE rss PUBLIC "-//Netscape Communications//DTD RSS 0.91//EN" "http://my.netscape.com/publish/formats/rss-0.91.dtd">

<rss version="0.91">

   <description> JabberCentral is the premiere Jabber end-user
   news and support site. Many Jabber developers are actively involved
   at JabberCentral to provide fresh and authoritative information for
   users. </description>
   <copyright>Copyright 2001, Aspect Networks</copyright>
   <item> <title>JabberCon Update 11:45am - Aug
   998329970</link> <description>JabberCon Update - Monday
   Morning</description> </item>
   <item> <title>Jabcast Promises Secure Jabber
   998061331</link> <description> Jabcast announces their
   intention to release security plugins with their line of products
   and services. </description> </item>
   ... (more items) ...


The structure is very straightforward. Each RSS file describes a channel, which is defined as follows:

Channel information
The channel header information includes the channel's title
(<title/>), short description
(<description/>), main URL (<link/>),
and so on. The channel in this case is JabberCentral.
Channel image
Often RSS information is rendered into HTML to provide a concise
"current index" summary of the channel it describes. An image can be
used in that summary rendering, and its definition is held in the
<image/> section of the file.
Channel items
The bulk of the RSS file content is made up of the individual
<item/> sections, each of which reflects an item on the
site that the channel represents. We can see in Example 9-8 that the
first <item/> tag:

<item> <title>JabberCon Update 11:45am - Aug 20</title> <link>http://www.jabbercentral.com/news/view.php?news_id=998329970 </link> <description>JabberCon Update - Monday Morning</description> </item>

describes the most recent news item shown on JabberCentral's main
page—"JabberCon Update 11:45am - Aug 20." Each of the news item
properties are contained within that <item/> tag: the
title (<title/>), short description
(<description/>), and link to main story
Channel interactive feature
There is a possibility for each channel to describe an interactive
feature on the site it represents; often this is a search engine
fronted by a text input field and Submit button. The interactive
feature section of an RSS file is used to describe how that mechanism
is to work (the name of the input field and the Submit button and the
URL to invoke when the button is clicked, for example). This is so
HTML renderings of the site can include the feature otherwise
available only on the original site.

{{Note|This interactive feature definition is not shown in the RSS example here.

</code> RSS information lends itself very well to various methods of viewing. There are custom "headline viewer" clients available—focused applications that allow you to select from a vast array of RSS sources and have links to items displayed on your desktop (so, yes, the personal newspaper—of sorts—is here!). There are also possibilities for having RSS items scroll by on your desktop control bar.

And then there's Jabber. As described in Section 5.4.1, the Jabber <message/> element can represent something that looks suspiciously like an RSS item. The message type "headline" defines a message that carries news headline information. In this case, the <message/> element itself is usually embellished with an extension, qualified by the jabber:x:oob namespace (described in Section 6.3.8). Example 9-9 shows what the element would look like if the first news item from the JabberCentral site were carried in a headline message.

A headline message carrying a JabberCentral news item

<message type='headline' to='dj@qmacro.dyndns.org'> <subject>JabberCon Update 11:45am - Aug 20</subject> <body>JabberCon Update - Monday Morning</body> <x xmlns='jabber:x:oob'> <url>http://www.jabbercentral.com/news/view.php?news_id=998329970& lt;/url> <desc>JabberCon Update - Monday Morning</desc> </x> </message>

The jabber:x:oob namespace carries the crucial parts of the RSS item. Clients, such as WinJab and Jarl, can understand this extension and display the content in a clickable list of headlines, each representing a single RSS item, similar to the headline viewer clients mentioned earlier.

Of course, we could send RSS items to clients in nonheadline type messages:

<message type='headline' to='dj@qmacro.dyndns.org'> <subject>JabberCon Update 11:45am - Aug 20</subject> <body> JabberCon Update - Monday Morning http://www.jabbercentral.com/news/view.php?news_id=998329970 </body> </x> </message>

where the complete item information is transmitted in a combination of the <subject/> and <body/> tags. This works, but the clients can only display the message, as with any other message. However, if we send formalized metadata, the value of the message content increases enormously. (Figure 9-8 shows Jarl displaying RSS-sourced news headlines.)

Distributing RSS-sourced headlines over Jabber to standard Jabber clients is a great combination of off-the-shelf technologies. In fact, we'll see in the next section that it's not just standard Jabber clients that fit the bill; we'll write a Jabber-based headline viewer to show that not all Jabber clients are, nor should they be, made equal.

Writing the News Agent

We're going to write an RSS news agent, which we'll simply call newsagent. The newsagent is a mechanism that checks predefined sources for new RSS items and sends (or pushes) them to people who are interested in receiving them. For the sake of simplicity, we'll define the list of RSS sources in newsagent itself. See Section 9.3.4 later in this chapter for details on how to further develop this script.

The newsagent script as a component

Until now, the examples we've used, such as cvsmsg, HostAlive, and keyassist (shown in Chapter 8), have all existed as Jabber clients. That is, they've performed a service while connected to the Jabber server via the JSM. There's nothing wrong with this. Indeed, it's more than just fine to build Jabber-based mechanisms using a Jabber client stub connection; that way, your script, through its identity—the user JID—can avail itself of all the IM-related functions that the JSM offers—presence, storage and forwarding of messages, and so on. Perhaps even more interesting is that the mechanism needs only an account, a username, and a password on a Jabber server to be part of the big connected picture.

However, we know from Chapter 4 that there are other entities that connect to Jabber to provide services. These entities are called components. You can look at components as philosophically less "transient" than their client-connected brethren and also closer to the Jabber server in terms of function and connection.

We know from Section 4.1.1 that there are various ways to connect a component: library load, STDIO, and TCP sockets. The first two dictate that the component will be located on the same host as the jabberd backbone to which it connects, although a Jabber server could consist of

a collection of jabberds running on separate hosts.    The TCP

sockets connection type uses a socket connection between the component and the jabberd backbone, over which streamed XML documents are exchanged (in the same way they are exchanged in a client connection). It allows us to run components on any host and connect them to a Jabber server running on another host if we wish. This approach is the most desirable because of the connection flexibility. But it's not just the flexibility that matters: because the component is abstracted away from the Jabber server core libraries, it's up to us to decide how the component should be written. All the component has to do to get the Jabber server to cooperate is to establish the socket connection as described in the component instance configuration, perform an authenticating handshake, and correctly exchange XML stream headers.

Let's review how a TCP socket-based component connects. We'll base the review on what we're actually going to have to do to get newsagent up and running.

First, we have to tell the Jabber server that it is to expect an incoming socket connection attempt, which it is to accept. We do this by defining a component instance definition (or "description"—see Section 4.2.1) for our component. We include this definition in the main Jabber server configuration file, usually called jabber.xml. Example 9-10 shows a component instance definition for the RSS news agent, known as rss.qmacro.dyndns.org.

A component instance definition for the RSS news agent

<service id='rss.qmacro.dyndns.org'> <accept> <ip>localhost</ip> <port>5999</port> <secret>secret</secret> </accept> </service>

The name of the host on which the main Jabber server is running is qmacro.dyndns.org; it just so happens that the plan is to run the RSS news agent component on the same host. We give it a unique name (rss.qmacro.dyndns.org) to enable the jabberd backbone, or hub, to distinguish it from other components and to be able to route elements to it.

An alternate way of writing the component instance definition is shown in Example 9-11. The difference is simply in the way we specify the name. In Example 9-10, we specified an id in the <service/> tag with the value rss.qmacro.dyndns.org. In the absence of any <host/> tag specification in the definition, this id value is used by the jabberd routing logic as the identification for the component when determining where elements addressed with that destination should be sent. In Example 9-11, we have an explicit <host/> specification that will be used instead; we simply identify the service with an id attribute value of rss. In this latter case, it doesn't really matter from an addressability point of view what we specify as the value for the id attribute.

An alternative instance definition for the RSS news agent

<service id='rss'> <host>rss.qmacro.dyndns.org</host> <accept> <ip>localhost</ip> <port>5999</port> <secret>secret</secret> </accept> </service>

The instance definition contains all the information the Jabber server needs. We can tell from the <accept/> tag that this definition describes a TCP sockets connection. The socket connection detail is held in the <ip/> and <port/> tags. In this case, as we're going to run the RSS News Agent component on the same host as the Jabber server itself, we might as well kill two related birds—performance and security—with one stone by specifying in the <ip/> tag:[6]

Connecting over the loopback device, as opposed to a real network
interface, will give us a slight performance boost.
Accepting only on the loopback device is a simple security measure
that leaves one less port open to the world. The
<secret/> tag holds the secret that the connecting
component must present in the authentication handshake. How the secret
is specified is described later on in this section.

Now let's look at the component's view of things. It will need to establish a socket connection to Once that connection has been established, jabberd will be expecting it to announce itself by sending its XML document stream header. Example 9-12 shows a typical stream header that the component will need to send.

The RSS component's stream header

SEND: <?xml version='1.0'?> <stream:stream xmlns='jabber:component:accept' xmlns:stream='http://etherx.jabber.org/streams' to='localhost'>

This matches the description of a Jabber XML stream header (also known as a stream "root" as it's the root tag of the XML document) from Section 5.3. The namespace that is specified as the one qualifying the content of the stream is jabber:component:accept. This namespace matches the component connection method (TCP sockets) and the significant tag name in the component instance definition (). Likewise, the namespace jabber:component:exec matches the STDIO component connection method and the significant tag name in its component instance definition format: ()—see Section The value specified in the to attribute matches the hostname specified in the configuration's <ip/> tag.

After receiving a valid stream header, jabberd responds with a similar root to head up its own XML document stream going in the opposite direction (from server to component). A typical response to the header (Example 9-12) received from the server by the component is shown in Example 9-13.

The server's stream header reply

RECV: <?xml version='1.0'?> <stream:stream xmlns:stream='http://etherx.jabber.org/streams' id='3B8E3540' xmlns='jabber:component:accept' from='rss'>

The stream header sent in response shows that the server is confirming the component instance's identification as rss. This reflects whatever was specified in the <service/> tag's id attribute of the component instance definition. Here, the value of the id attribute was rss as in Example 9-11.

It also contains an ID for the component instance itself (id="3B8E3540'). This ID is a random string shared between both connecting parties; the value is used in the next stage of the connection attempt—the authenticating handshake.

The digest authentication method for clients connecting to the JSM is described in Section This method uses a similar shared random string. On receipt of the server's stream header, the component takes the ID and prepends it onto the secret that it must authenticate itself with. It then creates a NIST SHA-1 message digest (in a hexadecimal format) of that value:

SHA1_HEX(ID+SECRET) After the digest is created, it is sent in a <handshake/> element as the first XML fragment following the root:

SEND: <handshake id="1">14d437033d7735f893d509c002194be1c69dc500</handshake>

On receipt of this authentication request, jabberd combines the ID value with the value from the <secret/> tag in the component instance definition and performs the same digest algorithm. If the digests match, the component is deemed to have authenticated itself correctly, and it is then sent back an empty <handshake/> tag in confirmation:

<handshake/> The component may commence sending (and being sent) elements.

If the component sends an invalid handshake value—the secret may be wrong or the digest may not have been calculated correctly—the connection is closed: jabberd sends a stream error, ending the conversation:

RECV: <stream:error>Invalid handshake</stream:error>

Working out who gets what newsfeeds

Definitions of the RSS sources are held within the newsagent itself, but there's no reference to who might want to receive new items from which sources. We need a way for the component to accept requests, from users, that say things like:

"I'd like to have pointers to new items from Slashdot sent to me, please."
"I'd also like pointers to new items on Jon Udell's site, please."
or even:
"Whoa, information overflow! Stop all my feeds!"
There's a common theme that binds together components such

as the Jabber User Directory (JUD), and the transports to other IM systems such as Yahoo! and ICQ. This theme is known as registration. We've seen this before in the form of user registration, described in Section 6.5.2. This is the process of creating a new account with the JSM. Registration with a service such as the JUD or an IM transport, however, follows a similar process, and both types of registration have one thing in common: the jabber:iq:register namespace.

The jabber:iq:register namespace is used to qualify the exchange of information during a registration process. The registration process to create a new user account with the JSM uses the jabber:iq:register namespace to qualify registration data exchanged. The registration process with the JSM to modify the account details (name, email address, and so on) also uses jabber:iq:register to qualify the account amendment data exchanged. Both types of registration requests are addressed to the JSM. The difference, which allows the JSM to distinguish between what is being requested, is that no session is active on the stream between client and server in the new user registration process, whereas in the account amendment process, a session is active. This is also mentioned in Section 7.

The jabber:iq:register namespace is described in Section 6.2.11 in Chapter 6. It shows us how a typical conversation between requester and responder takes place:

  1. The client sends an IQ-get: "How do I register?" The component
  2. sends an IQ-result: "Here's how:

follow these instructions to fill in these fields."

  1. The client then sends an IQ-set with values in the fields: "OK,
  2. here's my registration request." To which the component responds,
  3. with another IQ-result: "Looks fine. Your registration details have
  4. been stored."

It's clear that this sort of model will lend itself well to the process of allowing users to make requests to receive pointers to new items from RSS sources chosen from a list. Example 9-14 shows this conversational model in Jabber XML. There are many fields that can be used in a registration request; the description in Section 6.2.11 in Chapter 6 includes a few of these—<name/>, <first/>, <last/>, and <email/>—but there are more. We'll take the <text/> field to accept the name of an RSS source when a user attempts to register his interest to receive pointers to new items from that source. The conversational model is shown from the component's perspective.

A registration conversation for RSS sources "How do I register?"

RECV: <iq type='get' id='JCOM_3' to='rss.qmacro.dyndns.org' from='dj@qmacro.dyndns.org/basement'> <query xmlns='jabber:iq:register'/> </iq>

"Here's how:"

SEND: <iq id='JCOM_3' type='result' to='dj@qmacro.dyndns.org/basement' from='rss.qmacro.dyndns.org'> <query xmlns='jabber:iq:register'> <instructions> Choose an RSS source from: Slashdot, Jon Udell[, ...] </instructions> <text/> </query> </iq>

"OK, here's my registration request:"

RECV: <iq type='set' id='JCOM_5' to='rss.qmacro.dyndns.org' from='dj@qmacro.dyndns.org/basement'> <query xmlns='jabber:iq:register'> <text>Slashdot</text> </query> </iq>

"Looks fine. Your registration details have been stored."

SEND: <iq id='JCOM_5' type='result' to='dj@qmacro.dyndns.org/basement' from='rss.qmacro.dyndns.org'> <query xmlns='jabber:iq:register'> <text>Slashdot</text> </query> </iq>

After some time passes...

"Whoa, information overflow! Stop all my feeds!"

RECV: <iq id='JCOM_11' to='rss.qmacro.dyndns.org' type='set' from='dj@qmacro.dyndns.org/basement'> <query xmlns='jabber:iq:register'> <remove/> </query> </iq>

"OK, you've been removed. All feeds stopped."

SEND: <iq id='JCOM_11' to='dj@qmacro.dyndns.org/basement' type='result' from='rss.qmacro.dyndns.org'> <query xmlns='jabber:iq:register'> <remove/> </query> </iq>

A lightweight persistent storage system is used for the user/source registrations—DataBase Manager (DBM)—to keep the script fairly simple.

The bigger question here is: how will the user know he can register to a particular RSS feed? Or more importantly: how can he determine if the RSS News Agent system exists? Most clients, having connected to the server and established a session with the JSM, make a request for a list of agents (old terminology) or services (new terminology) available from the Jabber server with the following IQ-get method:

SEND: <iq id="wjAgents" to="qmacro.dyndns.org" type="get"> <query xmlns="jabber:iq:agents"/> </iq>

The response to the request looks like this:

RECV: <iq id='wjAgents' to='dj@qmacro.dyndns.org/basement' type='result' from='qmacro.dyndns.org'> <query xmlns='jabber:iq:agents'> <agent jid='conf.qmacro.dyndns.org'> <name>Public Chatrooms</name> <service>public</service> <groupchat/> </agent> <agent jid='users.jabber.org'> <name>Jabber User Directory</name> <service>jud</service> <search/> <register/> </agent> </query> </iq>

which reflects the contents of the <browse/> section in the JSM configuration as shown in Example 9-15.

The JSM configuration's browse section

<browse> <conference type="public" jid="conf.qmacro.dyndns.org" name="Public Chatrooms"/> <service type="jud" jid="users.jabber.org" name="Jabber User Directory"> <ns>jabber:iq:search</ns> <ns>jabber:iq:register</ns> </service> </browse>

If we add a stanza that describes the component for the RSS News Agent to the <browse/> section of the JSM configuration:

<service type="rss" jid="rss.qmacro.dyndns.org" name="RSS News Agent"> <ns>jabber:iq:register</ns> </service>

we get an extra section in the jabber:iq:agents response from the server:

<agent jid='rss.qmacro.dyndns.org'> <name>RSS News Agent</name> <service>rss</service> <register/> </agent>

The client-side effect of the agents response is exactly what we're looking for. Figure 9-6 shows WinJab's Agents menu displaying a summary of what it received in response to its jabber:iq:agents query.

WinJab's Agents menu

</code> We can see that the stanza for the RSS news agent was present in the <browse/> section and the component is faithfully displayed in the agent list, along with Public Chatrooms and Jabber User Directory. In the main window of the screenshot we can see the Supported Namespaces list; it contains the namespace that we specified in the stanza. By specifying:

<ns>jabber:iq:register</ns> we're effectively telling the client that the component will support a registration conversation.

But that's not all—we've advertised the RSS news agent in the <browse/> section of the configuration for the JSM on the Jabber server running on qmacro.dyndns.org. That's why we got the information about the RSS news agent when we connected as user dj to qmacro.dyndns.org (see the window's titlebar in Figure 9-6). You may have noticed something odd about the definition of the other two agents, or services, in the <browse/> section earlier or in the corresponding jabber:iq:agents IQ response. Let's take a look at this response again, this time with the extra detail about the component:

RECV: <iq id='wjAgents' to='dj@qmacro.dyndns.org/basement' type='result' from='qmacro.dyndns.org'> <query xmlns='jabber:iq:agents'> <agent jid='rss.qmacro.dyndns.org'> <name>RSS News Agent</name> <service>rss</service> <register/> </agent> <agent jid='conf.qmacro.dyndns.org'> <name>Public Chatrooms</name> <service>public</service> <groupchat/> </agent> <agent jid='users.jabber.org'> <name>Jabber User Directory</name> <service>jud</service> <search/> <register/> </agent> </query> </iq>

While the jid attribute values for the RSS news agent and Public Chatroom agents show that they are components connected to the Jabber server (i.e., they both have JIDs in the qmacro.dyndns.org "space," and so are connected to the Jabber server running at qmacro.dyndns.org), the jid attribute for the Jabber User Directory points to a name in the jabber.org "space"! This is a side effect of the power and foresight of Jabber's architectural design. If we connect a component—whether it's one we've built ourselves or one we've downloaded from http://download.jabber.org—we can give it an internal or external identity when we describe it in the jabber.xml configuration.

Example 9-8 and Example 9-9 show two examples of an instance definition for the RSS news agent component. Both specify potentially external identities. If the hostname rss.qmacro.dyndns.org is a valid and resolvable hostname, the component can be reached from anywhere, not just from within the Jabber server to which it is connected. If the hostname wasn't resolvable by the outside world, by having a simple name such as rss, it could be reached only from the Jabber server to which it was connected.

So let's say rss.qmacro.dyndns.orgis a valid and resolvable hostname. If your client is connected to a Jabber server running on yourserver.org, this is what would happen if you were to send a registration request (an <iq/> element with a query qualified by the jabber:iq:register namespace) addressed to rss.qmacro.dyndns.org:

Packet reaches JSM on yourserver.org.
You send the IQ from your client, which is connected to your Jabber
server's JSM. This is where the packet first arrives.
Internal routing tables consulted.
This is how yourserver.org's jabberd looks in its list of
internally registered destinations and doesn't find
rss.qmacro.dyndns.org in there.
Name resolved and routing established.
yourserver.org's dnsrv (Hostname Resolution) service
is used to resolve the rss.qmacro.dyndns.org's address. Then,
according to dnsrv's instance configuration (specifically the
<resend>s2s</resend> part—see Section 4.9), the IQ is
routed on to the s2s (Server to Server) component.
Server to server connection established.
yourserver.org establishes a connection to
qmacro.dyndns.org via s2s and sends the IQ across
the connection.
Packet arrives at the RSS News Agent component on qmacro.dyndns.org.
jabberd on qmacro.dyndns.org routes the packet correctly to
rss.qmacro.dyndns.org. So, what do we learn from this? As
exemplified by the reference to the JUD running at
users.jabber.org that comes predefined in the standard
jabber.xml with the 1.4.1 version of the Jabber server, you can
specify references to services, components, on other Jabber
servers. If you take this RSS News Agent script and run it against
your own Jabber server, there's no reason why you can't share its
services with your friends who run their own Jabber servers.

The key is not the reference in the <browse/> section; it is the resolvability of component names as hostnames and the ability of Jabber servers to route packets to each other. The stanza in <browse/> just makes it easier for the clients to automatically know about and be able to interact with services in general. Even if a service offered by a public component that wasn't described in the result of a jabber:iq:agents query, it wouldn't stop you from reaching it. The agent browser is another client, Gabber (shown in Figure 9-7), which is a GTK-based Jabber client that allows you to specify a Jabber server name, in the Server to Browse field, so that you can direct the jabber:iq:agents queries to whatever server you want.

Viewing agents on different Jabber servers with Gabber

</code> A good example of the distinction between the definition of a component within a <browse/> section and that component's reachability is the version query shown in Example 9-16. Regardless of whether the conference component at gnu.mine.nu was listed in the <browse/> section of qmacro.dyndns.org's JSM, the user dj was able to make a version query by specifying the component's address (a valid and resolvable hostname) in the IQ-get's to attribute.

A Conferencing component responds to a version query

SEND: <iq type='get' to='conf.gnu.mine.nu'> <query xmlns='jabber:iq:version'/> </iq>

RECV: <iq type='result' to='dj@qmacro.dyndns.org/study' from='conf.gnu.mine.nu'> <query xmlns='jabber:iq:version'> <name>conference</name> <version>0.4</version> <os>Linux 2.2.13</os> </query> </iq>

Polling the RSS sources

Next, we need some way of "interrupting" the process of checking for incoming elements and dispatching them to the callbacks, while we retrieve the RSS data and check for new items. Since we're writing this component in Perl, we could use the alarm() feature to set an alarm and have a subroutine invoked, to poll the RSS sources, when the alarm goes off. However, this recipe uses the Jabber::Connection library, which negates the needs for an external alarm. Instead, we need to take the following steps each time we want to poll the RSS sources:

  1. Try to retrieve the source from the URL. Attempt to parse the source's
  2. XML. Go through the items, until we come across one we've seen before;
  3. the ones we go through until then are deemed to be new. (We need a
  4. special case the first time around, so that we don't flood everyone
  5. with every item of a source the first time it is retrieved.) For new
  6. items, look in the registration database for the users that have
  7. registered for that source, construct a headline message like the one
  8. shown in Example 9-7, and send it to those users. Remember the first
  9. of the new items, so that we don't go beyond it next time.

Other Differences Between Client and Component Programming

There are many differences between programming a component and programming a client. We're already aware of many of the major ones, described earlier in Section There are, however, also more subtle differences that we need to bear in mind.

Components, unlike clients, do not connect to the JSM. They connect as a peer of the JSM. This means not only that they cannot partake of IM features made available by JSM's modules (see Section 4.4.4 for a list of these modules) but also that they must do more for themselves. This isn't as bad as it seems. Take store and forward, for example, a feature provided by the JSM's mod_offline module. While a message sent to a component won't be stored and forwarded if that component is not connected, a message sent from a component to a client will get stored and forwarded if the client is offline, because the message will be routed to the JSM (as specified by the [hostname] in the address), which can decide what action to take. Messages can be passed directly to the client if the user is online or can be stored and forwarded later when they're back online.

When constructing an element as a client, we should not specify a from attribute before it is sent; this is added by the JSM as it arrives to prevent JID spoofing. If a component does not connect through the JSM, no "from-stamping" takes place; the component itself must stamp the element with a from attribute.

The addressing of a component is also slightly different. Client addresses reflect the fact that they're connected to the JSM, always having the form of (with the resource being optional):

[user]@[hostname]/[resource] While the basic address form of a component is simply:

[hostname] This doesn't mean to say that the address of a component cannot have a user or a resource part. It's just that all elements addressed to:

anything@[hostname]/anything will be routed by jabberd to the component. This means the component can play multiple roles and have many personalities. We'll see an example of this in the script, where we construct an "artificial" [user]@[hostname] address for the from attribute of a <message/> element, to convey information.

The component will respond to IQ queries in the jabber:iq:register namespace. It is customary, although by no means mandatory, for components to respond to queries in a set of common IQ namespaces. We see that both the JUD and Conferencing components, for example, respond to IQ queries in the jabber:iq:time and jabber:iq:version namespaces. Example 9-16 shows a typical version query on a Conferencing component. This responsiveness is simply to provide a basic level of administrative information. We want the component to conform to the customs, so we'll make sure it also responds to queries in these namespaces.

The RSS News Agent Script

The RSS news agent script (newsagent) is written in Perl and is shown here in Example 9-17.

The RSS news agent script, written in Perl

my $NAME = 'RSS News Agent'; my $ID = 'rss.qmacro.dyndns.org'; my $VERSION = '0.1'; my $reg_file = 'registrations'; my %reg;

my %cache;

my %sources = ( 'jonudell' => 'http://udell.roninhouse.com/udell.rdf', 'slashdot' => 'http://slashdot.org/slashdot.rdf',

 # etc ...


tie (%reg, 'MLDBM', $reg_file) or die "Cannot tie to $reg_file: $!\n";

my $c = new Jabber::Connection( server => 'localhost:5999', localname => $ID, ns => 'jabber:component:accept', );

unless ($c->connect()) { die "oops: ".$c->lastError; }

$SIG{HUP} = $SIG{KILL} = $SIG{TERM} = $SIG{INT} = \&cleanup;

debug("registering RSS beat"); $c->register_beat(1800, \&rss);

debug("registering IQ handlers"); $c->register_handler('iq',\&iq_register); $c->register_handler('iq',\&iq_version); $c->register_handler('iq',\&iq_browse); $c->register_handler('iq',\&iq_notimpl);



sub iq_register { my $node = shift; debug("[iq_register]"); return unless my $query = $node->getTag(, NS_REGISTER); debug("--> registration request");

 # Reg query
 if ($node->attr('type') eq IQ_GET) { $node = toFrom($node);
 $node->attr('type', IQ_RESULT); my $instructions = "Choose an RSS
 source from: ".join(", ", keys %sources);
 $query->insertTag('text'); $c->send($node);
 # Reg request
 if ($node->attr('type') eq IQ_SET) {
   # Strip JID to user@host
   my $jid = stripJID($node->attr('from'));
   $node = toFrom($node); my $source;
   # Could be an unregister
   if ($query->getTag('remove')) { delete $reg{$jid};
   $node->attr('type', IQ_RESULT);
   # Otherwise it's a registration for a source
   elsif ($source = $query->getTag('text')->data and
   exists($sources{$source})) { my $element = $reg{$jid};
   $element->{$source} = 1; $reg{$jid} = $element;
   $node->attr('type', IQ_RESULT);
   else { $node->attr('type', IQ_ERROR); my $error =
   $node->insertTag('error'); $error->attr('code', '405');
   $error->data('Not Allowed');
 return r_HANDLED;


sub iq_version { my $node = shift; debug("[iq_version]"); return unless my $query = $node->getTag(, NS_VERSION) and $node->attr('type', IQ_GET); debug("--> version request"); $node = toFrom($node); $node->attr('code', IQ_RESULT); $query->insertTag('name')->data($NAME); $query->insertTag('version')->data($VERSION); $query->insertTag('os')->data(`uname -sr`); $c->send($node); return r_HANDLED; }

sub iq_browse { my $node = shift; debug("[iq_browse]"); return unless my $query = $node->getTag(, NS_BROWSE) and $node->attr('type', IQ_GET); debug("--> browse request"); $node = toFrom($node); $node->attr('type', IQ_RESULT); my $rss = $query->insertTag('service'); $rss->attr('type', 'rss'); $rss->attr('jid', $ID); $rss->attr('name', $NAME); $rss->insertTag('ns')->data(NS_REGISTER); $c->send($node); return r_HANDLED; }

sub iq_notimpl { my $node = shift; $node = toFrom($node); $node->attr('type', IQ_ERROR); my $error = $node->insertTag('error'); $error->attr('code', '501'); $error->data('Not Implemented'); $c->send($node); return r_HANDLED; }

sub rss { debug("[rss]");

 # Create NodeFactory
 my $nf = new Jabber::NodeFactory;
 # Go through each of the RSS sources
 foreach my $source (keys %sources) {
   # Retrieve attempt
   my $data = get($sources{$source});
   # Didn't get it? Next one
   unless (defined($data)) { debug("Cannot retrieve $source"); next;
   # Parse the RSS
   my $rss = XML::RSS->new(); eval { $rss->parse($data) };
   if ($@) { debug("Problems parsing $source"); next;
   my @items = @{$rss->{items;
   # Check new items
   debug("$source: looking for new items"); foreach my $item (@items) {
     # Stop checking if we get to items already seen
     last if exists $cache{$source} and $cache{$source} eq
     debug("$source: new item $item->{title}");
     # Create a headline message
     my $msg = $nf->newNode('message');
     $msg->attr('type', 'headline'); $msg->attr('from', join('@',
     $source, $ID));
     my $xoob = $msg->insertTag('x', NS_XOOB);
     # Deliver to all that want it
     foreach my $jid (keys %reg) {
       my $registration = $reg{$jid};
       if (exists($registration->{$source})) { $msg->attr('to',
       $jid); debug("Sending to $jid"); $c->send($msg);
     # Prevent all items counted as new the first time around
     last unless exists($cache{$source});
   # Remember the latest new item
   $cache{$source} = $items[0]->{link};


sub cleanup { debug("Cleaning up"); untie %reg; $c->disconnect; exit; }

sub toFrom { my $node = shift; my $to = $node->attr('to'); $node->attr('to', $node->attr('from')); $node->attr('from', $to); return $node; }

sub stripJID { my $JID = shift; $JID =~ s|/.*$||; return $JID; }

sub debug { print STDERR "debug: ", @_, "\n"; }


Reviewing the RSS News Agent Script Step by Step

Now that we know what the newsagent script looks like in its entirety, let's review the script piece by piece.

Module declarations and variable definitions

We start out by using the Jabber::Connection library, which is defined as follows:

use strict; use Jabber::Connection; use Jabber::NodeFactory; use Jabber::NS qw(:all); use MLDBM 'DB_File'; use LWP::Simple; use XML::RSS;

The Jabber::Connection library consists of the following three modules:

This module is used to manage the connection to the server and parses
and dispatches incoming elements.
This module allows us to manipulate elements, which are generically
called nodes.
The last module provides us with a list of constants that reflect
namespaces and other common strings used in Jabber server, client, and
component programming. Next we need a way of storing the registration
information between invocations of the component script, and for that
we'll use the Multi-Level Database Manager module, MLDBM.
MLDBM is a useful wrapper that can be placed around the
DB_File module. DB_File provides access to Berlekey
Database (Berkeley DB, at http://www.sleepycat.com) facilities using
the tie() function. While you can't store references (i.e.,
complex data structures) via DB_File, you can with the
MLDBM wrapper.

We will use the LWP::Simple module to grab the RSS sources by URL and the XML::RSS module to parse those sources once retrieved:

my $NAME = 'RSS News Agent'; my $ID = 'rss.qmacro.dyndns.org'; my $VERSION = '0.1'; my $reg_file = 'registrations'; my %reg;

my %cache;

my %sources = ( 'jonudell' => 'http://udell.roninhouse.com/udell.rdf', 'slashdot' => 'http://slashdot.org/slashdot.rdf',

 # etc ...


We start by declaring a few variables. We will see later in the script that $NAME, $ID, and $VERSION will be used to reflect information in response to IQ queries. The variable $reg_file defines the name of the DB file to which we will be tying the registration hash, %reg. %cache is the RSS item cache, which holds items we've already seen so we know when we've come to the end of the new items in a particular source.

We define the RSS sources in %sources. These can be defined differently, perhaps outside of the script, but this gives you a general idea of how these should look. There are a couple of examples here; add your own favorite channels to taste.

To make persistent any data we store in the %reg hash, we can use the magic of the tie() function:

tie (%reg, 'MLDBM', $reg_file) or die "Cannot tie to $reg_file: $!\n";

It works by binding (tie'ing) the operations on the hash (add, delete, and so on) to Berlekey DB operations, using the MLDBM module to stringify (and reconstruct) complex data structures so they can be stored and retrieved.

Connecting to the Jabber server

Now we're ready to connect to the Jabber server as a component. Despite what's involved (described in Section it's very easy to make the connection to the Jabber server using a library such as Jabber::Connection:

my $c = new Jabber::Connection( server => 'localhost:5999', localname => $ID, ns => 'jabber:component:accept', );

We construct a Jabber::Connection object, specifying the details of the connection we wish to make. The server argument is used to specify the hostname, and optionally the port, of the Jabber server to which we wish to connect. In the case of a component, we must always specify the port (which is 5999 in this example, as shown in Example 9-8). The same constructor can be used to create a client connection to the Jabber server, in which case a default port of 5222 (the standard port for client connections) is assumed if none is explicitly specified. The localname argument is used to specify the component's name, which in this case is rss.qmacro.dyndns.org. In the same way that a default port of 5222 is assumed if none is specified, a default stream namespace of jabber:client is assumed if no ns argument is specified. Since we want to connect as a component using the TCP sockets connection method, we must specify the appropriate namespace: jabber:component:accept.

This constructor call results in a stream header being prepared; it looks like the one shown in Example 9-12.

The actual connection attempt, including the sending of the component's stream header, is done by calling the connect() method on the connection object in $c:

unless ($c->connect()) { die "oops: ".$c->lastError; }

This will return a true value if the connect succeeded (success is measured in whether the socket connection was established and whether the Jabber server sent a stream header in response). If it didn't succeed, we can retrieve details of what happened using the lastError() method.

We're connected. Before performing the authenticating handshake, we're going to do a bit of preparation:

$SIG{HUP} = $SIG{KILL} = $SIG{TERM} = $SIG{INT} = \&cleanup;

The idea is that the component will be run and stopped only in certain circumstances. If it is stopped, we want to clean things up before the script ends. Most importantly, we need to make sure the registration data is safe, but also we want to play nicely with the server and gracefully disconnect. This is done in the cleanup() function.

Preparing the RSS event function and element handlers

Jabber::Connection offers a simple way of having a function execute at regular intervals. It avoids the need for setting and resetting alarms. It is the register_beat() function:

debug("registering RSS beat"); $c->register_beat(1800, \&rss);

Calling the register_beat() method takes two arguments. The first argument represents the interval, in seconds. The second is a reference to the function that should be invoked at each interval. Here, we're saying we want the rss() function called every 30 minutes (1800 seconds).

Most of the traffic relating to the component will be the headline messages emanating from it. However, we are expecting incoming IQ elements, particularly for registration in the jabber:iq:register namespace. We've also already mentioned that it's customary for components t

Personal tools