Digital Humanities: Reflections on distant reading and why the ability to fly hasn’t stopped us walking.

One of the themes explored in the Digital Humanities is often “what exactly do we mean by Digital Humanities” because everyone asks and there are any number of self-described skeptics who seem to have an inability to add any new categories to their knowledge hierarchies. We’re studying the intersection of traditional computing and humanities so we’re asking the old question of “where does the desert end” which is only answered locally and specifically, rather than globally and generally.  But a major fear for Humanists that came up during the week I was in Maryland was the threat of a colonising external force that would fundamentally alter what Humanists did until it was unrecognisable. I’m going to talk briefly about my view of digital humanities as a parallel augmentation, rather than a displacing colonisation.

Many areas of Humanities use the notion of close reading, where the text is carefully read and interpreted as part of a sustained effort. While this is exemplary for extracting themes and really getting into the work it doesn’t scale up well. We keep producing things to read and there is a limit as to how many things you can close read. This is where distant reading can come in, because it scans works thematically and syntactically, and provides an aggregate or abstraction to the reader. This is scalable and fast, because we can computerise it, but it risks inaccuracy, shallowness and is guaranteed to have the bias of the analysis software.

Let me step back and talk about travel for a moment. We started (well, by we, I mean bipedal humans) moving around on our feet. Then we did things with animals – in a vehicular sense – then the wheel, then lots of wheels, plus animals, plus betting – and that’s how we got the Colosseum. At some point, we stopped trying to put petrol into animals (who kept exploding) and tried it in cars intend. Suddenly we could zoom around, which widened our stride, but had the downside of enabling Italian Futurism at the start of the 20th century which led to all sorts of odd things and the declaration of war as the “great hygiene” until a lot of them died in the Great War – seriously, Marinetti, what were you thinking? (As a side note, the Futurist Cookbook is worth reading because it’s very Heston Blumenthal, just 90 years beforehand.)

Speed! Car! Vroom! Burn libraries! Vroom! We are the Early Supergrass of Art Philosophy!

Speed! Car! Vroom! Burn libraries! Vroom! We are the Early Supergrass of Art Philosophy!

Then we developed planes and the journey that took months on foot, weeks on animal and days by car, could take hours. But we never stopped walking, although we could now use our more advanced techniques to walk in new places and ultimately go further.

I feel exactly the same about close and distant reading. There are now (hooray) too many books on most subjects for any person to read in their life, let alone in a span to allow detailed analysis in a timely fashion. But this doesn’t mean we have to stop close reading. It means that we can look into topics and areas, refine our distant reading and visualisation, and then drill down once we’ve landed somewhere. Better still, distant reading allows us to link areas of close reading that may not be apparently connected – we can fly to a new place to explore that will develop the knowledge we already have.

Personally, I’d love it if the Humanities came and did a bit of colonisation in Computer Science, but I can completely understand why the reverse is culturally confronting. And I can also understand the many trad CS people who would also feel threatened by a counter-colonisation – although I probably don’t agree with their reasoning.

Going to a course like this is always good for my thinking as it requires me to switch gears and lens to get things done. I strongly recommend stepping out of the comfort zone of your own discipline when you can as it gives you extra knowledge and some valuable perspective.


Humanities Intensive Learning + Teaching, Day 5, Maryland Institute for Technology in the Humanities, #hilt2014

Sorry for the delay in completing this – it has been crazy and I prefer to write directly into the live blog, which means a network feed, as I just find it easier to put things together this way. (It’s only been a week (ish) anyway.)

Today (well, then), we looked at modularity  and how we could break networks into separate communities. This is important because it helps us to be able to see structure. The human eye is a great tool but it often needs help and modularity is a good way to do this. We have strong ties between components in directed networks (unidirectional) because we have taken the time to say that the link goes this way. We have weak ties in undirected networks because there is no solidity to the association form one side to the other. The more connected something is, the more strongly tied it is to the things it connects to so, when we hunt for communities, we want to take away the least number of connections to produce the largest number of connected communities.

Let’s look at the Hamlet graph (after Moretti) with some colouring added for the various communities and all the connections left in.

It's hard to see any separation here although the communities can be seen.

It’s hard to see any separation here although the communities can be seen.

Now let’s take out the intra-community links.

Bingo! Poor old Hamlet.

Bingo! Poor old Hamlet.

Now we can suddenly see, in terms of dialogue exchanges and interaction, how isolated Hamlet actually is. The members of the court are one community. The invaders are another. Horatio, the proxy for the Danish state, is deeply involved in statehood. Hamlet, however, is more heavily connected to the characters who help with his dawning realisation that something has gone awry. No wonder he goes mad, he’s the Nigel Nofriends of Medieval Denmark, with only the insane Clown Posse and a group of second-rate actors to keep him company.

After this we spent some extensive time working on producing prettier looking graphs with Gephi and using more advanced algorithms to help us to represent what’s going on. We looked at the Twitter data for the conference and came up with this:

Twitter data using the #HILT2014 tag

Twitter data using the #HILT2014 tag

This shows the separate groups of Twitter user communities who used the HILT2014 hashtag. My small cluster is the small purple arrow bottomish left. We’ve also ranked the size of the data labels based on the significance of that person in the Twitter information chain, using the PageRank algorithm. So, Jim McGrath wins Twitter for HILT 2014! We also look and how important certain links were in the network by looking at edge betweenness to determine which links are used by the most shortest paths and then use this to identify important connected components.

Lots of good hands-on stuff and a very interesting course! I learned a great deal and have already started using it elsewhere.


Humanities Intensive Learning + Teaching, Day 4, Maryland Institute for Technology in the Humanities, #hilt2014

Or, the alternative title, “The Play Formerly Known as Hamlet”. Today had a lot of fascination discussion where the bipartite nature of our class network became apparent, in terms of the the majority of the class who were Digital Humanists and really understood a detailed close reading of Hamlet – and your humble narrator.

Today we talked about modularity, which allows you to break your network up into separate modules, which makes a lot more sense when you call those modules things like communities, groups or clusters. Can your network be split up into areas where some groups are more connected to each other than they are to adjacent nodes? By doing this, we are trying to expose structural features of the network and, in particular, unexpected aspects of network structure – is there some set of nodes that have a role in the network that we don’t understand? If we talk about this in terms of people, if our social network is made up of small communities with connections between the communities, then we would expect gossip to spread more rapidly inside a community than between the communities. Simple, eh? The approach we take depends upon comparing the structure we have with a model of a random network using the same number of links.

Once we’ve done this, we can use this in a tool, such as Gephi, to clearly illustrate the groups. Here’s a grumpy Dane in illustration.

Hamlet and the various posses of Hamlet. Note the Insane Clown Posse.

Hamlet and the various posses of Hamlet. Note the Insane Clown Posse.

I’ve coloured and tagged the network to show the key players in Hamlet, based on Moretti’s analysis of Hamlet, which attached unweighted connections between participants who had direct conversations. Now, we’ve made the size of the nodes reflect how relatively important (in terms of betweenness, the number of paths that must go through this person). If we did this in terms of PageRank, another measure of the relative importance of nodes, based on their connectivity, the nodes in green would jump up in size. But what you should note is that Hamlet and Horatio are roughly the same size and, while Hamlet is much more connected to everyone (quelle surprise, given the play’s named after him), the only thing that we lose if Hamlet disappears is that we no longer can hear from the Insane Clown Posse, Lucianus and the Lord. In purely connected terms he doesn’t appear to be that important. We’d obviously lose a lot of the text if he disappeared but how important is Hamlet in Hamlet?

This led to a lot of discussion in class as to the validity of the original Moretti pamphlet, much of which is core to the entire discussion of Digital Humanities. How valid is any conclusion from a network model such as this when the innate nature of the capture process (to provide the capta) may completely remove the ability to draw certain conclusions? The class discussed the impact on perceived and invisible observers, who strictly don’t have dialogue with each other but potentially have an impact upon other characters and their narrative interactions. (The Moretti pamphlet may be found here.) Moretti had a lot to say about Horatio as a proxy for the state and was very interested in the closeness of Hamlet to everyone else, but (later on) we ran some random network experiments and it turned out to be the type of connections in the network (the clustering coefficient) that was more interesting than the closeness.

We then moved on to a discussion of a number of useful metric for networks, including the clustering effects that tend to  indicate intention in the system we’re studying (there’s no real reason for a random network to form a small number of unique clusters unless you tune for it.) We also discussed the Small Worlds of Watts and Strogatz (1998) where you have cliques of nodes (tightly connected clusters) linked together by a smaller number of links, characterised by a power law distribution and a higher clustering coefficient (very basically).

We generated some random graphs to see if we got the structure we saw in Hamlet as noted earlier. Why? Because looking at Hamlet and drawing out information from the structure only has validity if (1) the model is accurate and (2) the situation couldn’t have easily arisen by chance. As noted, we generated a random graph for the same number of nodes and it had a similar average path length and identical diameter – but very different clustering coefficients! So it turns out that Hamlet wasn’t written by a monkey after all.

Don't laugh, this is Darwin's skull!

Don’t laugh, this is Darwin’s skull!

The final part of the session was on dynamic networks. This is the study of networks that change over time and we worked with some data that showed people’s association with an area over time. We could look at this in terms of change in time or in terms of progress through a corpus (chapters are as useful as dates here). What you want is a start date and an end date for the feautres in your network – when should the feature be there and when should it not be there anymore? It turns out that Gephi is quite convenient here, because you can merge a start and end time and end up with a time interval. Not bad, Gephi, not bad. Now we can see when things are active – great if you’re looking to see when students are active in forums or activities, for example. Here’s an example of the difference in the test network between the first and second halves of 2009, with all nodes of zero degree (nodes with no links) removed.

First half of 2009

First half of 2009

Second half of 2009

Second half of 2009

We then played around a lot with a vide variety of animations over time, including ongoing calculations, colour and shape alterations. Basically, it was rather hallucinatory by the end but that may be the Dr Pepper talking. We even got to see the sparklines (Zelchenko/Tufte) for some extra visualisation goodness!

This is one of those classic separations between the things we visualise for ourselves to help us to work out what’s interesting about a dataset and what we would visualise for presentation, especially to non-discipline people. There’s a really big difference between what scientists want to see and what other people want to see – and if we don’t realise that, then we risk either having presentations that don’t communicate enough information or we kill people with minutiae. Look at this – how useful is it for you?

Death by Sparkline!

Death by Sparkline!

Another good day but I think there are some very full brains!


Humanities Intensive Learning + Teaching, Day 3, Maryland Institute for Technology in the Humanities, #hilt2014

Today was going to be a short day because we had the afternoon off to go and do cultural things. (I took the afternoon to write papers and catch up on work. I tend to work in both timezones when travelling because otherwise work will eat my head.) Today we explored a lot of filtering in Gephi, which was … interesting and best done in practice. Which we did. So, hooray!

We looked at Multimodal Network Projection throughout the day but I’ll come back to that. We started looking at other ways of determining the important and dependant nodes in a network, starting with the HITS algorithm, which identifies Hubs and Authorities in your network.

Then we moved into the wonderful world of PageRank, Citation networks and how all of these things work. PageRank is, fundamentally, how Google works out which pages to give you. You won’t be able to find out the details of the way that Google runs PageRank because gaming PageRank to serve up pages that you probably didn’t want to see is very big business and highly irritating. Search Engine Optimisers (SEOs) spend a lot of time trying to reverse engineer the algorithm and Google spends a lot of time tweaking it. It’s good we’ve solved all the important problems in the world so we can spend time on this.

Back to PageRank. PageRank looks at the number of links going to a node and what the quality of these links is in order to work out which the most important nodes (pages in the case of Google search) are and, hence, which ones you want. In Digital Humanities, you can use this to identify the most significant element of your model – which, by the way, quickly establishes that Hamlet is not as important as you think. Take that, Hamlet! Want more detail on PageRank? Look here.

From Wikipedia: The more you are pointed to by quality links, the bigger (and happier) you are. Hooray!

From Wikipedia: The more you are pointed to by quality links, the bigger (and happier) you are. Hooray!

In Citations, we want to see how is citing which reference, which is straightforward. In Co-Citation networks, we want to measure how often two documents are cited together. There are many reasons for looking at this, but it helps to detect things like cronyism (someone citing a paper because they like someone rather than because the information is useful). As we discussed before, the Matthew Effect comes in quickly, where frequently cited papers get even more frequently cited because they must be good because they’re cited so frequently. (Tadahhh.)

We also looked at a rather complicated area of multimodal projection, which is going to need some set-up. If you have a set of authors and a set of publications, then you can associate authors with publications and vice versa. However, this means that the only way for two authors to be connected is by sharing a publication and similarly for two publications sharing an author. This is a bipartite network and is very common in this kind of modelling. Now, if we make it more complicated, by moving to a conference and having Authors, Papers and Presentation Sessions, we now have a tripartite network and this becomes very hard to visualise.

What we can do is clean up this network to make it easier to represent by hiding some of the complexity in the connections between nodes. Let’s say we want to look at Authors and Presentation Sessions. Then, while the real network is Authors connected to Papers connected to Presentation Sessions, we can hide the Papers with a network link that effectively says “connects this author via a presentation to this session” and suddenly our network looks like it’s only Authors and Sessions. This level of visual de-cluttering, which is dimensional reduction for those playing along at home, makes it easier for us to visually represent the key information and produce network statistics on these simpler graphs. It’s also a natural fit for triple-based representations like the Resource Description Framework (RDF) because the links in the network now map straight to predicates. (Don’t worry if you didn’t get that last bit, some people just got very excited.)

Finally, we looked at how we collect information. Is it stuff we just pick up from the environment (data) or is it something that we choose the way that we collect it (capta)? (Capta comes from the word for capture. Data is passive. Capta is active. Take that, Bembridge Scholars!) If you think about  it, every time you put your data into a spreadsheet, you are imposing a structure upon it, even down to which column is which – it’s not technically data, it’s capta because your interpretation alters it before it even reaches the analysis stage. When it comes to the network that you draw, do you care about the Proximities of elements in your network (location, membership or attitude), the Relations in your network (role, affective, perceptual), the Interactions or the Flows? All of these are going to change what the nodes and edges (links) represent in the network.

The simple rule is that entities are connected by relationships (For those who think in tuples, think “subject, predicate, object” and get your predicate on!) However you do it, you have to pick what’s important in your data, find it, capture it, analyse it and present it in a way that either shows you cool things or supports the cool things that you already know.

A lot to cover today!

After the session, I did some work and then headed off for dinner and board games with some of the other people from the workshop. A nice relaxing night after a rather intense three days.


Humanities Intensive Learning + Teaching, Day 2, Maryland Institute for Technology in the Humanities, #hilt2014

In Day 2, we looked at using the Gephi tool itself, along with the nature of how networks are tied together, looking at network degree and how we could calculate paths through the network. It’s probably important to talk about some of the key concepts of how we measure connectedness in a network, and the relevant importance of nodes.

The degree of a node is the number of links that connect to it. If we care about whether the links have some idea of direction associated with them then we might split this into in-degree and out-degree, being the number of links going in and the number of links going out. What do we mean by direction? Consider Facebook. If you and I are going to be Friends then we both have to agree to be Friends – I can’t be a friend unless you want to a friend to. This is an undirected arrangement and a connection between us implies that both us have an equal connection to each other. Now think about unrequited love: Duckie loves Andie but Andie loves Blane. In this case, the love is directed. Just because Duckie loves Andie, it doesn’t mean that Andie loves Duckie.

Time to put on some Psychedelic Furs and find the alternate ending. (Yeah, yeah, it's not actually in the story and it's creepy.)

Time to put on some Psychedelic Furs and find the alternate ending. (Yeah, yeah, it’s not actually in the story and it’s creepy.)

ALTHOUGH IT SHOULD, JOHN HUGHES!!!

(An amusing aside for network/maths people is that, sadly, love is not transitive or “Pretty in Pink” would have been way ahead of its time.)

One of the other things that we care about in networks is the relative importance of nodes in terms of how many other nodes they are connected to and what this means in terms of the paths we take through the network. When we talk about paths, we usually mean the shortest path, where we start somewhere and go through the minimum number of intermediate points until we get to the destination. We don’t run around in circles. We don’t go one way and then go another. This is important because the paths through a network can quickly identify the important nodes and, therefore, the links between them that are the most travelled thoroughfares.

In this world, we start to worry about the centrality of a node, which indicates how important it is by looking at how many other nodes it is connected to or how many other nodes have to use it to get to other places in the network. This means that we have to talk about betweenness, which measures how many times the shortest paths that traverse a network have to go through a node. By calculating the betweenness of every node, for every path, we can work out which of the elements in our network see the most traffic.

In the case of the Medicis, from yesterday, all roads in Florence lead to the Medicis, the family with the highest betweenness rather than the family with the most money or fame. The Medicis are an extreme case because they occupy their position of importance as the only bridge (broker) between certain families.

If a network is made of highly connected elements and all of the betweenness is the same then no-one has an advantage. If your network can be effectively split into two highly connected groups, with a smaller number of high-betweenness elements linking them, then you are seeing a separation that may mean something significant in your particular network domain. From a power perspective, the high betweenness brokers now have the potential to have much more influence if they charge for transit and transform information that traverses them.

One of the things about creating a network from data is that the network we create may not necessarily model reality in a way that answers the questions we’re interested in, but by looking at the network and trying to work out if it’s got some different structure at the macro, meso and micro scale, then that might give us hints as to how to analyse it, to further develop our understanding of the area that we’re modelling with this network.

I’ve written about the difference between the real world and the model before, but let’s just say that “the map is not the territory” and move on. In terms of the structure of networks, while many people assume that the distribution of nodes and links associated with them would end up in some sort of Normal distribution, the truth is that we tend to see a hub-spoke tendency, where there are lots of nodes with few links and fewer nodes with lots of links. When we start to look at the detailed structure rather than the average structure, we can even talk about uniform the structure is. If a network looks the same across itself, such as the same number of connections between nodes, then it’s what we call assortative. If we have small clusters of highly connected nodes joined to other clusters with sparse links, then we’d thinking of it as disassortative. Suddenly, we are moving beyond some overall statistics to look at what we can really say about the structure of a network.

There’s also a phenomenon known as the Matthew Effect, where links and nodes added to a network tend to connect to better connected nodes and, as the network grows, nodes with more connections just get more connected – just like life.

Apart from a lot of Gephi, we finished the day by looking at paths in more detail, which allow us to talk about the network diameter, the size of the largest shortest path in the network. (Remembering that a shortest path contains no loops or false starts, so the largest shortest path shows you the largest number of unique nodes you can visit in the network and gives you an idea of how much time you can waste in that network. 🙂 )

There are some key concepts here, where having redundant paths in a network allows us to survive parts of a network going away (whether computer or social) and having redundant paths of the same quality allows us to endure loss without seeing a significant change in the shortest paths in the network. These are all concepts that we’re familiar with in real life but, once we start to measure them in our network models, we find out the critical components in our data model. If we’re modelling student forums, is there one person who is brokering all of the communication? What happens if she gets sick or leaves the course? Now we have numbers to discuss this – for more, tune in to the next instalment!

 


Humanities Intensive Learning + Teaching, Day 1, Maryland Institute for Technology in the Humanities, #hilt2014

I’m attending the Humanities Intensive Learning + Teaching courses at Maryland Institute for Technology in the Humanities, #hilt2014, for the second year running. Last year was Matt Jocker’s excellent course on R and this year I’m attending Elijah Meek’s course on Network Analysis and Visualisation. The first day we covered network basics and why you might want to actually carry out visualisation across graphs – and what the hell are graphs anyway?

Graphs, put simply, are a collection of things and connections between those things. Now that I’ve killed every mathematician reading this blog, let’s continue. I’ve done a lot of work on this before in the Internet Topology Zoo but it’s now looking like analysis of large-scale online education is something I have to get good at, so it seemed a great opportunity to come and see how the DH community do this and get access to some mature processes and tools.

Why is this important as a visualisation (or representation, thanks, Elijah) target? Because pictures tell stories well and we can use this to drive argument and make change.

Let’s consider the Medici, the family who dominated Florence from the 1400s to the 18th century. While not being the most wealthy and powerful families at the outset, they were (by marriage and arrangements) one of the most well connected families. In fact, the connections from some groups of families to other families had to go through the Medicis – which made them more important because of their role in the network.

Padget & Ansell’s network of marriages and economic relationships between Florentine families. (from http://www.themacroscope.org/?page_id=308)

Padget & Ansell’s network of marriages and economic relationships between Florentine families. (from http://www.themacroscope.org/?page_id=308)

The graph makes the relationship and the importance clear. (Insert toast about Boston, Lowells and Cabots here.)

In graphs of the Internet, everything is connected to the Internet by definition, so we don’t have any isolated elements. (We do have networks that don’t connect to the Internet, such as super-secret defence networks and some power stations – not as many as there used to be – but we’re interested in the Internet.) It is possible to analyse communities and show ways that some people/entities/organisations are not connected to each other. Sometimes they form disconnected clusters, sometimes they sit by themselves, and this is where my interest comes in, because we can use this to analyse student behaviour as a learning community.

A student who enrols in your course is notionally part of your community but this is an administrative view of the network. It’s only when they take part in any learning and teaching activity that they actually become part of the learning community. Suddenly all of the students in your network can have a range of different types of connection, which is a good start to finding categories to talk about behaviour in large on-line courses, because now we can easily separate enrolment from attendance, attendance from viewing, viewing from participation in discussion, and discussion from submission of work. I hope to have a good look into this to find some nice (convenient) mathematical descriptions of the now defunct cMOOC/xMOOC distinction and how we can encourage behaviour to get students to form robust learning networks.

As we can see from the Medicis, the Medicis used their position in order to gain power – it wasn’t in their interests to form additional connections to make the network resilient if they fell on hard times. However, learning networks don’t want a central point that can fail (central points of failure are to be avoided in most contexts!) and this is why a learning community is so important. If students are connected to many other students and their lecturing staff, then the chances of one relationship (connection) failing causing the whole network to fail is very low. Some people are, naturally, very important in a learning community: we’d hope that instructors would be, tutors would be, and key students who handle discussions or explanations also would be. However, if we have very few of these important people, then everyone else is depending upon this small number to stay connected and this puts a lot of stress on these people and makes it easy for your network to fall apart.

I’ll talk more about this tomorrow and hit you with you some definitions!


Teaching in Hong Kong, Reflections

I used to wait for a while before writing my reflections, to give myself time to sort the information and think about things. These days, one amazing thing finishes and then another starts. I’ve had a lot less free time than I thought I would on this course and I have about a day of downtime before starting work again on Monday (I fly back this afternoon) and then I’m off again on another project a week from then. So I’ll try and capture my thoughts now, although I always hope to come back and adjust them later as I think. (I actually have a maxim when I’m travelling to “do it while I’m there” because I’ve discovered that, too often, the way back takes you on a different path, so I may never revisit this but, knowing this, I’ll try to put in as much as I can.)

In talking to the students, and in seeing their marks, this has been a success but, like all pathfinders, the problem now is marking the trail so that other people can find it. How do we do this again? How do we it better? Well, we’re already talking about setting up the administration earlier to get some things going that we wanted the first time around and I’ve got a list of suggestions from the students as to what they think could be better.

But this was a strong validation of guided study groups, blended learning, flipping, collaborative work and giving students some freedom. This also justified putting the effort into industry visits and sharing what we do in our school with our students. This has been an excellent educational experience, with emphasis on both parts of that. The community has made it work, because the students commented that they’d never spent so much time digging into references to understand what was going on, and that discussion was an important part of that. However, their physical proximity allowed them to do it face-to-face, rather than on a forum.

That’s an interesting point – if you look at our electronic footprint, you’d think the course was a failure. Instead, because we spent at least 3-4 hours together every day, issues were resolved quickly and only broadcast when they had to be. You’d be right in assuming that that can be a little wearing on the instructor, because you’re always on, but that’s why I’m here.

The next trick is going to be getting this to work for less crazy instructors – in the last couple of days, I’ve been really tired, because the constant “on” (while doing my normal job back home in the cracks) is not what I’m used to and I don’t have my usual home comforts to settle back in to, in order to unwind. Not a big complaint, as I’ve been very comfortable, but a lot of people would find it hard to leave their families for three weeks, let alone act as mentor, guide and teacher to 10 people across that time. Having said that, the students have been great and I’ve worried more about them as a general principle than I’ve had to deal with as a reaction to problems. But it is still wearing.

Probably, the most important thing to do next time is to go ahead with the pan of integrating local students as well, to maximise the mix. Travel broadens the mind, because the longer you spend in other cultures, the more of your own you can start to see without your own biases. We’ve done very well in exposing students to different areas of HK and Macua, in seeing other communities and industries, and in meeting people – but this would be so much better if they worked side-by-side with students from CUHK. Apart from a learning experience, we could be building working and friendship relationships measured in decades rather than a trip measured in weeks.

In summary, it went well and we managed to combine engaging learning with effective learning – so I’m very happy to call this a successful start, already.

If you’ve been following the course, thanks for reading, and I’m always happy to answer questions in the comments! Ciao!

Take us out, Bruce!

EMBRACE YOUR AWESOMENESS! Don't make me come over and sing "Blinded by the Light!"

EMBRACE YOUR AWESOMENESS! Don’t make me come over and sing “Blinded by the Light!”


Teaching in Hong Kong, Day 15 (Final Day)

The final day started a little late as we didn’t have as much to do. At 10am we kicked off with a quick end-of-course note from me, thanking the students for their enthusiasm and their participation, and their willingness to go along with the whole scheme. They, in turn, presented me with a very large book about cats (a running theme in the course and my own family contains three cats – where, by contains, I of course mean exists for the service of) which they had all inscribed. It was a delightful way to start the day!

Shally and Louis from the OAL came along and we presented them with gifts to thank them for all of their help and support during the last three weeks: Australian wine and excellent chocolate, in case you’re wondering. They, in turn, presented us with lemon cake, a CUHK dessert speciality, which is a sort of lemony.. frozen.. corn flakey… iced? treat that is deliciously indescribable! Thank you again to everyone at CUHK: Shally, Louis, Phoebe, Daniel and Cora, for all of your help while we’ve been here!

Seconds (and thirds) are had.

Seconds (and thirds) are had.

After a very pleasant 30 minutes, we got back to the final on-campus business, the quiz and the short-answer exam. With that concluded, I was able to inform the students to check their marks to look at their progress and some students will notice that they already have enough raw marks to pass – although they won’t stop now!

That concluded the course, except for a vegetarian dinner at a Buddhist “mock meat” restaurant in Hung Hom, arranged by David, our Alumni Ambassador for Hong Kong and China. The meal was astounding and we got take a double decker bus ride through Hung Hom from the MTR to get there – adding another experience into the mix! Here are some of the photos and dishes from the meal:

All of this is vegetarian but you can't tell from the taste or the texture.

All of this is vegetarian but you can’t tell from the taste or the texture.

Here are two pictures with David serving from the (ample) banquet and the students enjoying themselves:

photo 2 photo 3

(The very small student is David’s daughter – Ashley.) Those of you with sharp eyes might recognise the fruit on the table as Durian – the King of Fruits and the fruit with one of the most amazing reputations. David went out specially to go and get two beautiful durians for the students to try something new (Kelvin and I needed no convincing as we are both strongly committed to the durian) and most people were very pleasantly surprised that the smell wasn’t as bad as they’d heard and that the taste was very much worth pursuing It is, however, amazingly rich, there’s a lot of the avocado richness concealed in the ultra-mango and (being honest) spring onion flavours of the durian.

Here’s the final picture of the course, standing proud in front of the defeated durian. We came to Hong Kong to learn, to experience and to have a lot of fun. And we did all three. Thank you, Hong Kong!

Thumbs up and mug for the camera!

Thumbs up and mug for the camera!

 


Teaching in Hong Kong, Day 14

Thursday the 24th was a very straightforward day as we were bringing most of the coursework to a close and moving into the evaluation phase – not just for the students but for the course itself. We started with a tutorial on security, where we went through a range of issues and looked into the most commonly occurring problems, as recorded by the Computer Emergency Response Team (CERT) webpage. A lot of the problems we get are caused by:

  • Programmers not checking what users type into their programs.
  • Programmers not properly limiting what programs send to each other.
  • “Back door” exploits where hard-coded usernames and passwords are left in.

We had a chance to discuss white hat and black hat hacking – patching a bug will get you a small cheque and a shout out, reporting an exploit to a criminal syndicate can make you rich. (And hunted or dead, for that matter.) It’s an interesting area – ethically and academically.

I’d asked the students, on Tuesday, to pick an area of security to talk about for 5-10 minutes and this proved to be a really interesting activity, taking longer than the hour I’d allotted for it. Topics included the Stuxnet Worm, and you can imagine we got a lot of interesting discussion out of this one, mobile device problems, the RTM Work from 1988, Botnets, Cross-site scripting issues, and the Heartbleed bug. (Compulsory XKCD comic here.)

XKCD 1354 – Heartbleed explained concisely and accurately.

I like asking the students to go out and find things because, by presenting it to each other, we get infectious enthusiasm and we have people applying their knowledge to reinforce what we’ve learned in class.

Then, for the final hour, we sat down and discussed the course itself. The students are still free to put in anonymous reports on things they don’t feel comfortable discussing with me, but I wanted to get a feel for what we should be improving. Overall, the course has been a great success and, teaching-wise, it’s gone pretty well. We have to smooth out some administration issues on our side but we already knew that and we’re starting the planning process for the next time in the next week.

It’s really great for me to get a sense that the students both enjoyed the course and learned from the course. As we always say, engaging isn’t always effective, but in this case it appears that it has been and, so far, the results I’ve seen reflect that we’ve achieved a great deal in a short time.

Only one day left, which is mostly a final talk from me, some “thank you”s and a dinner. Then we all start our flights home.

Only two more updates for this trip to come – the final Friday and my overall reflections. Hang in there, I’m nearly done!


Teaching in Hong Kong, Day 13 – MACAU!

We decided to add some interesting activities on the Wednesdays, to provide a break from the course, and Wednesday the 23rd saw us going to Macau (or Macao), another Special Administrative Region and a former Portuguese colony bordering on China. I picked up the students at the University train station and we headed down to Sheung Wan on Central for the ferry. It turns out that Hong Kong’s MTR system is busy during rush hour:

I'm slightly addicted to Panorama, yes, but wow, look at that station.

I’m slightly addicted to Panorama, yes, but wow, look at that station.

We went on the TurboJet to get a good ride across to Macau, sitting upstairs for sight-seeing and for a better ride. The students quickly got into the correct mode.

 

Why, yes, I would like to discuss that issue with you.

Why, yes, I would like to discuss that issue with you.

No pictures, please.

No pictures, please.

Landing in Macau itself, which is another country after all, we went through immigration and then outside the ferry terminal. where I had to constantly deflect the attentions of the touts who were all trying to get us onto a bus or a tour or something. One of the reasons to go to Macau was to show the students more of the region, the ecosystem of a casino-driven economy like Macau and the ways that cultures integrate differently, even with a similar group mixing in.

I wanted to start the students off at St Pauls, ruins in the heart of Macau island itself, so this required three cabs … and three languages. The first cab required me to dust off my terrible Mandarin (Qing day wo men qu da san ba pai fang?), the second pretended to understand my Sportuguese (Ruinas de Sao Paulo, por favor?), although it turned out that he tried to take the students for a ‘ride’ so they got out and walked the rest of the way, and the last one listened to my awful linguistics and said “St Paul’s” and away we went.

We (finally) all managed to meet up and, boy , was it hot! While being slightly less humid than Hong Kong, the Macau sun is a killer. I gave the team a potted history of Macau, pointed out some things, exhorted them to eat Macanese egg tarts (one of the finest desserts in the world) and then sent them off to explore, asking them to be back at the Ferry Terminal at 4:30 for a 5pm departure.

The group in front of St Paul's.

The group in front of St Paul’s.

The group split up into pairs and fours. Some went off to the Bungee Platform, and had a fantastic time I hear. Others drifted off to look at the glitz and glamour of Macau. All managed to get right into the movement, lane ways, food and egg tarts of a different culture, right on the doorstep of where they’d been living for the past three weeks.

I drifted off to remind myself of some of the more languid Macau experiences, having lunch at a nice Chinese place and taking pictures of the Macanese obsession with Neon. (Hong Kong and Macau take neon to new heights.)

This is a fairly restrained shopfront. You should see it at night!

This is a fairly restrained shopfront. You should see it at night!

I headed back to the ferry terminal early to get some work done (I am actually working on this trip so have to fit marking, planning and course coordination in amongst all of the intensive teaching and events). Come 4:30 we had most of the group back and I put them on to the ferry, while the last two screamed up in a taxi and we got on the boat just in time for a relaxing trip back to Hong Kong. (Pro tip: catching taxis in Macau is harder than it looks but your group leader will probably be understanding if the delay was caused by egg tarts.)

11 of us left for a fun side trip and all of us got back. I called that a success and left the group to make their own way back to the University while I made a side trip of my own up to the Ozone bar at the top of the Ritz-Carlton Hotel in West Kowloon. You can’t go to Hong Kong and not visit the highest bar in the world, now, can you?

The Ozone bar's design was 'organically inspired' but, fortunately, not in a Gigeresque manner.

The Ozone bar’s design was ‘organically inspired’ but, fortunately, not in a Gigeresque manner.

Another great day, finished, and only two days left to go in the course!