The paper Sasha and I have been working on for quite a long time has just been published at the International Journal of Communication PageOneX: New Approaches to Newspaper Front Page Analysis. We hope it provides an useful guide and resource to the field of the newspapers front page analysis.

PageOneX: New Approaches to Newspaper Front Page Analysis
Sasha Costanza-Chock, Pablo Rey-Mazón


PageOneX is a Free/Libre and Open Source Software tool that we designed to aid in the coding, analysis, and visualization of newspaper front pages. Communication scholars have long analyzed newspaper front pages, using column inches as an important indicator of mass media attention. In the past, this involved obtaining physical copies of newspapers, coding and measurement by hand, and manual input of measurements into a spreadsheet or database, followed by calculation, analysis, and visualization. PageOneX automates some of these steps, simplifies others, and makes it possible for teams of investigators to conduct shared newspaper front page analysis online. We review scholarship in this area, describe our approach in depth, provide concrete examples of analysis conducted by media scholars using PageOneX, and discuss future directions for research and development.

Download article (pdf).


Article about column inches and number of headlines in newspapers in 1930
Análisis cuantitativo de la cobertura medido en “column inches” en el año 1930.

PageOneX estará en el II Encuentro de DatAnalysis15M: Modelos, analogías y análisis de las mutaciones y evolución del movimiento red 15M este viernes 19 de septiembre 2014 a las 16.30h en Barcelona.

Habrá video streaming: Lugar: Sala William J. Mitchell. Edificio MediaTic. Carrer Roc Boronat 117, 7ª planta. Barcelona.

Estaré presentando la investigación PageOneX: nuevos enfoques en el análisis de portadas de periódicos. Será un recorrido por la historia del análisis de periódicos y portadas de prensa, así como mostrar y explorar las posibilidades de PageOneX como herramienta para el estudio de la cobertura de movimientos sociales.

PageOneX es un software que facilita la codificación y análisis de noticias en portadas de prensa.

Programa completo del II Encuentro Datanalysis15M

II Encuentro de DatAnalysis15M: Modelos, analogías y análisis de las mutaciones y evolución del movimiento red 15M

Jueves 18 septiembre
16.00h Presentación
16.20h Sincronización y corporización en el 15M. Testeando la hipótesis de núcleos dinámicos en el 15M. Miguel Aguilera, Universidad de Zaragoza
16.50h. Modelos multicapa, medidas posibles y posibles analogías. >Emanuele Ecozzo, Universidad de Zaragoza.
17.20h. Preguntas.
18.00h. Neurociencia sistémica y tecnopolitica. Xabier Barandiaran, Universidad del País Vasco
18.30h. Aplicación de modelos evolutivos en el análisis de redes de comunicación de gran escala. Ignacio Morer, Universidad de Zaragoza.
19.00h. Preguntas.

Viernes 19 septiembre
16.00h. Jóvenes y comunicación en el #15M: Uso de las herramientas online/offline para obtener información de la #acampadabcn. Ariadna Fernández, Universitat Pompeu Fabra.
16.30h PageOneX: nuevos enfoques en el análisis de portadas de periódicos. Pablo Rey Mazón, Montera34.
17.00h. Preguntas.
18.00h. La emergencia de los movimientos red. Una conversación empírica y multidisciplinar con la teoría de los movimientos sociales. Arnau Monterde, Universitat Oberta de Catalunya.
18.30h. Ecosistema post15M, movimientos en red y poder constituyente. Javier Toret, Universitat Oberta de Catalunya.
19.00h. Preguntas.

Más infomación

  • Fecha: 18 y 19 de septiembre de 2014
  • Horario: de 16.00 a 20.00 horas
  • Lugar: Sala William J. Mitchell. Edificio MediaTic. Carrer Roc Boronat 117, 7ª planta. Barcelona
  • Streaming:
  • Twitter: #Data2
  • Inscripciones: la entrada es abierta, pero hace falta confirmar asistencia enviando un mail a

Now you can embed a PageOneX thread in your site:

You can check this thread at PageOneX > Corrupción España. Octubre 2013.

To get the embedded version you just need to add “/embed” at the end of the url
and embed that web page with an iframe.
This version of the display view is the same as the normal thread, but wihout the header (navigation bar, description, vertical legend) and footer.
You will have to play with the scale and height of the div and iframe to make it look good.

Code to embed:
<div id="wrap" style="width: 100%; height: 490px; padding: 0; overflow: hidden;">
<iframe style="width: 1280px; height: 700px; -ms-zoom: 0.68; -transform: scale(0.68);
-moz-transform: scale(0.68); -moz-transform-origin: 0 0; -o-transform: scale(0.68);
-o-transform-origin: 0 0; -webkit-transform: scale(0.68); -webkit-transform-origin: 0 0;
max-width: none; overflow: hidden; border: 1px solid white;"

Another example about Coverage of Gezi Park protests in Turkish newspapers last June.

This post is the newsletter that is sent to all the subscribers and people at the PageOneX users’ list.

Here is a compilation of all the things related to PageOneX that had happened in the past weeks. From now on, I hope I can make this more often.

Feaured threads

There have been good threads on PageOneX recently. I list here some of them:

Protesters or Vandals? How the traditional media approached Brazilian protests in June 2013 by Débora Leal.
Thread available at PageOnex: Violence x Protest

How 3 Colombian newspapers covered the agricultural strike? My analysis with @pageonex (in Spanish) by @Amelrame.
Based in the thread in PageOneX: Análisis Paro Agrario.

results La crisis de Pescanova en la prensa gallega
The collapse of Pescanova in Galizian newsapers
by @mcshuibhne. He analyses how the local newspapers from the Galicia, a region in Spain covered the collapse of the food company, debt of €3.6bn. I found interesting how he used the tool: highlighting in red the absence of coverage on the topic, and then classifying by type of heading (importance by h1, h2, h3). The percentage of surface is not useful in this case.
Based in the thread in PageOneX Pescanova.

#occupyGezi coverage in Turkish and International newspapers vs Twitter  Check the live threads at PageOneX OccupyGezi in International newspapers and OcccupyGezi in Turkish newspapers  Coded with @matrushka, @bilgenkurt and @numeroteca.


I’ve opened myself a month by month on Spanish corruption coverage. You can check all the threads unter the colorcorrupción category or in the specific Tumblr Besides the daily analysis, I’ve started a monthly review (September 2013) with more graphics using the export data feature at PageOneX. Last available thread at Corruption in Spain in September 2013.

If you wan to see your thread featured here, just comment this post. There is also a gallery of PageOneX cases at Pinterest.

PageOneX development

We changed the license of the code to a GNU AGPL v3, that suits better a web app like PageOneX, after the suggestion by a user related to the GNU project.

Some folks are trying to make their own PageOneX deployment in Heroku or elsewhere. We included an installation guide in the wiki and how to do it in Heroku. thanks to David Cabo for he advice.

There had been a lot of small improvements to make the tool more robust. Check the commits! With Edward L. Platt and Rahul Bhargava, from the Center of Civic Media, we keep fixing bugs.

Coverage of PageOneX: covering the coverage

There was an interesting article/interview in, a Spanish online newspaper, on PageOneX: Analyzing newspaper discourse with free software (Spanish) by @PauLlop.

PageOneX was featured in Euskadi Innova a public website for innovation in the Basque Country.

There was a post on PageOneX for the 6th congress on Research on Information and Communication in Madrid, Spain.

Thanks everyone for using PageOneX. It helps us to make it better!

Ads vs. News in News York Times
Thursday, June 20th 2013 edition of The New York Times.

Inspired by the film Manufacturing Consent we wanted to make the Ads vs News ratio in a New York Times print edition.

Results? Approx 2:1. For every 2 pages of news, there is one page of  advertisement.

Related projects to understanding newspapers:
Guns, Wars and Terrorism in a real size PageOneX (photo essay)
Anatomy of a front page

How to do this?

  1. Buy two copies of the same edition of one newspaper. You need two copies to be able to display both sides of every page. We used the exterior side of the papers from one copy, and the interior from the other copy.
  2. To be cautious, we  marked (draw a thin line) in the side of the paper that we were not going to use, to avoid having a piece of paper and not do not know which side is the one to use.
  3. Cut and separate Ads and News.
  4. Once you have the two piles with Ads and News, you have to make the bar charts. Keep’em straight and make them have the same width. To make the puzzle easier we put all the full (uncut) size pages together at the bottom of the  bars.

Anybody up for doing other newspapers?

Some pictures of the process:

Made by Rogelio López @Tochtli_exe and Pablo Rey Mazón @numeroteca.

The front page images in the display view are displayed differently if they have been coded or not. It makes easier for the user to know intuitively which images have been coded (more transparent) with the ones that hasn’t (more opaque).

I’ve been testing with different opacities: make both uncoded and coded images lose opacity, and color codes to lose transparency. Related commit.

Check the tests:

Coded: 0.65
Uncoded: 1


Coded: 0..3
Uncoded: 1


Coded: 0.15
Uncoded: 1
Opacity of color codes: from 0.5 to 0.6

and the finaland chosen selection:


Coded: 0.15
Uncoded: 0.6