Hi (again), I'm Zack, and this is my blog. Have a look at the most recent posts below, or browse the tag cloud here on the right.

Archives are available as well.

You can leave comments on my posts by following the relevant link associated to each post. Alternatively, you can mail me comments; note that unless otherwise requested, I will add mailed comments in the comment feeds.

The blog is mostly in English with some bits of Italian and French here and there.

debsources debbugs oh

My ongoing quest for lowering the barrier for contributing to Debsources continues.
In this chapter:

  • I've migrated bug reports from the previous ad-hoc text file in the Git repo to the Debian BTS, under the umbrella of the qa.debian.org pseudo-package.
    From now on this is the recommended (and documented) way of reporting bugs against http://sources.debian.net.

    Look ma, it also has one of those newfangled short URLs: http://deb.li/debsrcbugs!

  • While at it, I've also properly tagged the current easy hacks on Debsources using the gift tag. There are definitely opportunities for new contributors there, and there might be more if you submit your own Debsources' pet peeves to the BTS.

    Again, mandatory mnemonic/short URL: http://deb.li/debsrceasy.

What's your excuse for not contributing to Debsources, again?

Posted Thu 11 Sep 2014 07:31:13 PM CEST Tags:

my setup, take #1

Among the various things I've catched up with during the summer, I've finally managed to set aside some time to answer a pending interview request for The [GNU/]Linux Setup: a blog run by Steven Ovadia that collects interviews about how people use GNU/Linux-based desktops.

In the interview I discuss my day to day work-flow, from GNOME Shell to Mutt, from Emacs to Notmuch, and the various glue code tools I've written for integrating them.


Feedback is most welcome.

Posted Wed 03 Sep 2014 10:48:14 AM CEST Tags:

Debsources now has a HACKING file

Here at DebConf14 I have given a few talks. The second one has been a technical talk about recent and future developments on Debsources. Both the talk slides and video are available.

After the talk, various DebConf participants have approached me and started hacking on Debsources, which is awesome! As a result of their work, new shiny features will probably be announced shortly. Stay tuned.

When discussing with new contributors (hi Luciano, Raphael!), though, it quickly became clear that getting started with Debsources hacking wasn't particularly easy. In particular, doing a local deployment for testing purposes might be intimidating, due to the need of having a (partial) source mirror and whatnot. To fix that, I have now written a HACKING file for Debsources, which you can find at top-level in the Git repo.

Happy Debsources hacking!

Posted Sun 31 Aug 2014 10:02:45 PM CEST Tags:

Debsources: Live and Historical Views on Macro-Level Software Evolution

The paper entitled Debsources: Live and Historical Views on Macro-Level Software Evolution, which I've co-authored with Matthieu Caneill, has been accepted at ESEM 2014: the 8th international symposium on Emprical Software Engineering and Measurement.

In the paper we have described Debsources as a software platform for monitoring the evolution of Free Software through the lenses of Debian, and used the main Debsources instance (http://sources.debian.net) to replicate and extend a former study on macro-level software evolution.

Now we "just" have to integrate all the nice charts and data we have extracted for the paper into Debsources' stats page... /o\

Posted Fri 06 Jun 2014 01:22:34 PM CEST Tags:

moar, and moar, and moar debsources stats

A while ago I've announced the availability of several stats about Debian source code on http://sources.debian.net. Since then the statistical basis of those stats has increased a lot, and now includes all Debian historical releases, from hamm (July 1998) onward. This allows to appreciate macro-level evolution trends in Free Software, over a period of more than 15 years, through the eyes of a distro that sits at the nice intersection of the eldest, largest, and most reputed distros.

To get there I've added support for sticky suites to the plumbing layer of debsources, and then injected historical releases from http://archive.debian.org. The injection process took about a week (without any sort of parallelism, pretty slow disks, and computing sha256 checksums, ctags, and sloccount on all source files) and has been an "interesting" experience.

When you go back decades in technology time, bit rot is just around the corner, and I've found my share while injecting archive.d.o into sources.d.n. In both cases the respective maintainers (Guillem and Ganneff, kudos) have been positive about and helpful in improving the situation, despite the low impact of the bugs I've found on the average user. That's quite important for the long-term preservation of digital information in general, and for the perennity of access to Free Software in the specific case of Debian.

While we are it, I'm now maintaining a list of bugs affecting sources.d.n but belonging to other packages, in case you fancy helping out but are not a Python hacker. Interestingly enough, quite a bit of those bugs are related to the fact that tools debsources uses (e.g. ctags, sloccount) are also starting to show their age.

You might wander why buzz, rex, and bo are still missing from sources.d.n. That's in fact for similar reasons. Before hamm Debian didn't have complete archive coverage in terms of Sources indexes and .dsc files. Given that debsources rely on both to extract source packages, it first needs to grow an additional abstraction layer that can cope with their absence. It's SMOP, and planned.

And now let's have fun with ctags bombs.

Yours truly,
Stefano “Indiana” Zacchiroli
(credits: KiBi, #debian-ftp)

Posted Sun 06 Apr 2014 01:19:12 PM CEST Tags:

Debian: watch your stats!

Over the past few weeks, myself and Matthieu Caneill have worked quite a bit on Debsources. As we have now deployed most of the new features on http://sources.debian.net, it's time for another "What's new with Debsources?" blog post. Here is what's new:

  • Debsources now knows about Debian suites, i.e. which package is in which "release" (stable, testing, unstable, ...). This knowledge is already useful for some of the other features below and will be used more in the future.

  • since last summer Debsources has been running sloccount on all unpacked source packages, together with ctags and du, but the resulting information wasn't exposed on the Web. This is now fixed. Each package now has an infobox (example) which shows: disk usage, archive area, suites, and sloccount with per-language breakdown. The new infobox also subsumes the old puny list of package links.

    You can easily embed the infobox in other webapps if you need to (example). Check the URL scheme doc for more info.

  • Debsources now gathers and plot accurate Debian sources statistics, both overall and per-suite, in both snapshot and historical trends flavors.

    (Yeah, I know, the charts are not particularly good looking ATM, but that's easy to change without impacting the rest. So if you're a matplotlib artist and willing to help, please step forward!)

  • many changes have been going on also at the plumbing layer to make the service less resource hungry and more maintainable, in view of a migration to the official Debian infrastructure --- which I've in the meantime started discussing with DSA. Some highlights:

    • Debsources now has a rather comprehensive test suite, built using Nose. Most notably, we do test full update runs down to source unpacking (of a small subset of a Debian mirror), DB injection, and plugin execution --- which is quite neat.

    • the updater is now much faster (about 2x) and might require, in pathological cases, 10x less memory than before. Memory usage now caps at around 300MB, even when injecting ctags for large packages such as linux, chromium, and libreoffice.

    • the DB schema went through several refactoring cycles, and now uses a separate file table to index all known source file paths. In the past path information were duplicated across the checksums and ctags tables, not only wasting DB space, but also making the presence of file information conditional on the enablement of at least one of the two corresponding plugins. This is now fixed --- and migrating the full DB has been quite "fun". Unfortunately, we've also added quite a few large-ish indexes, resulting in no significant overall changes in DB size (currently at ~50GB), but at least in much faster queries :-)

      The next step on this front will be the addition of path-based searches, using the excellent Postgres trigram indexes.

Want more? Sure, we'll be happy to! But it'll happen faster if you help. Speaking of which: we've got Debsources into the new contributors game (see announcement) and we're looking forward to mentor new contributors.

Posted Thu 27 Feb 2014 08:22:00 PM CET Tags:

skyrocketing how-can-i-help popcon count

how-can-i-help by Lucas Nussbaum is one of the best things that happened in the area of attracting contributions to Debian in quite a while. It can be used both as a standalone tool to list opportunities for contributing to Debian which are related to your installed packages, and as an APT hook (which is also the default configuration) that at each upgrade will inform you of new contribution opportunities.

how-can-i-help is great for newbies who are looking for ways to give back to Debian which are a good match for their skills: among other things, how-can-i-help shows bugs tagged "gift" related to packages you use.

how-can-i-help is also great for experienced developers, as it allows them to find out, in a timely manner, that packages they use are in dire need of help: RC bugs, pending removals, adoptions needed, requests for sponsor, etc. (As highly unscientific evidence: I've noticed a rather quick turnover of RFA/O/ITA bugs on packages installed on my machine. I suspect how-can-i-help is somehow responsible for that, due to the fact that it increases awareness of ongoing package issues directly with the people using them.)

So, if you haven't yet, please apt-get install how-can-i-help RIGHT NOW.

I daresay that we should aim at installing how-can-i-help by default on all Debian machines, but that might be an ambitious initial goal. In the meantime I'll settle for making how-can-i-help's popcon count skyrocket. As of today, it looks like this:

how-can-i-help popularity contest graph, 10/02/2014

which is definitely too low for my taste. Please spread the word about how-can-i-help. And let's see what we can collectively do to that graph.

how-can-i-help is just a tiny teeny helper, but I'm convinced it can do wonders in liberating dormant contributions to the Debian Project.

Posted Mon 10 Feb 2014 03:52:08 PM CET Tags:

Qualche giorno fa ho partecipato alla trasmissione radiofonica Caterpillar su Radio 2, per parlare della recente pubblicazione delle linee guida che implementano l'articolo 68 del Codice per l'Amministrazione Digitale e dell'obbligo di preferire il software libero nella pubblica amministrazione (PA) italiana che tale articolo prevede. (Qui potete trovare un post in Italiano sullo stesso argomento.)

Chiacchierare con i Dott. Cirri e Zambotti è stato, come la volta scorsa, piacevole, divertente, e credo informativo per il grande pubblico di Radio 2. Il tutto è stato coronata dalla bella sorpresa che i conduttori mi hanno fatto di invitare, in coda al mio intervento, gli ascoltatori ad intervenire in diretta per segnalare success stories di adozione di software libero nella PA italiana.

A future memoria, rendo disponibile qui il podcast della puntata del 20 Gennaio 2014 (terza parte). Il mio intervento inizia al minuto 9" circa. Buon ascolto!

Posted Mon 27 Jan 2014 10:30:07 AM CET Tags:

Il servizio di podcast non ufficiale di Caterpillar che ho mantenuto per molti anni è per ora sospeso. Ho aggiornato il post corrispondente per segnalare la sospensione.

Il motivo è che drivecast sembra avere rimosso a partire da Gennaio 2014 la possibilità di registrare emittenti radio e generare podcast feed a partire da esse. Se siete a conoscenza di servizi alternativi che posso usare per ripristinare il servizio, vi prego di farmi sapere.

Posted Tue 14 Jan 2014 11:12:24 AM CET Tags:

Over the next few weeks I'll be on the road, attending a few Free Software events and giving talks. In particular:

  • during the upcoming week-end (18-19 January 2014) there will be a mini-DebConf in Paris. I'll be there to give a talk about Debsources (see schedule), participate in the Debian France plenary meeting, and meet Debian friends from all over Europe.

  • two weeks later (1-2 February 2014) I'll be at FOSDEM 2014 to give a retrospective talk about legal issues that Debian has faced over the past few years, participate in a panel about Free Software governance, … and meet Free Software friends from all over the world!

  • the day after FOSDEM end (3 February 2014) I'll be in Antwerp for the opening keynote of the SQM workshop to discuss some of my research work at IRILL and current challenges in the field

See you "there"?

Posted Tue 14 Jan 2014 11:12:24 AM CET Tags: