Publishr

The new face of the publishing industry

Overlapping Blueprints: A multi-layered approach to metadata and SEO

In a recent webinar organized by OCLC, Laura Dawson presented a very simple, yet essential definition of metadata: “Metadata is data about data.” In this equation, the latter ‘data’ refers specifically to the content of a pBook or eBook. The former ‘data’ is a collection of certain pieces of information about the content that we use to categorize, place, shelve, describe, identify, locate, etc. 

Thankfully, there was no limit imposed on how much of this former ‘data’ there could be. And, while many publishers agree on set pieces of metadata, albeit in a de-facto manner, this need not be the case. Expanding on Laura’s definition, let’s look at some examples of items that could be used effectively within the metadata mix, creating a system of layered metadata from within. 

In-line code: Code within the XML structure of content is going to be a very important place to lace strategy into bookselling. The more online discoverability maintains and expands its role in sales of book products, the more publishers are going to have to get smart about the search game. Some publishers have begun to do this already, others are exploring their options, but ultimately, it will become essential for publishers to include advanced SEO techniques in both their content itself as well as the coded wrapper it resides in. From finding hiding places to secret keywords, to tagging content properly and effectively, to refining titles and subtitles outwardly, to changing how things are done on copyright pages and TOCs, in-line code for both pBooks and eBooks is perhaps the new marketing plan. If you are not doing this, start now.  

Descritive copy: While our conception of metadata has expanded to be included in what we consider ‘marketing,’ the reverse movement has happened less so. Descriptive copy, being actual marketing copy, which feeds out to online accounts, via ONIX or other delivery method, is indeed part of a book’s metadata proper. It describes the content in a longer form than tags and codes, provides an inticing snippet to the consumer, and represents the whole of the product. This copy, like in-line metadata, needs to be optimized for search, as it will appear many places across the internet driving up the book’s collective page rank. Descriptive copy, while having a duty of its own, also needs to play a role behind the scenes in improving discoverability through search. The more keywords a publisher can integrate and the more search-friendly a publisher can make this copy the greater the benefitw will be for our products in the long run. 

Website SEO: Any search optimization performed on web properties related to each book should be related and integrated with search optimization perfomed on the content itself. With the ability to load keywords into XML codes, and, eventually, into the text itself, a strong SEO strategy needs to move beyond the content that we have, beyond the descriptive copy, and onto the streets, so to speak. At the very least, the web team that is setting up websites, landing pages, author blogs, and eCommerce pages must converse with teams that are performing keyword development for editorial and marketing purposes.

Social media: Social media outlets are wonderful for interacting with customers and peers alike, but despite the dissemination of huge amounts of information online, few have begun talking about SEO as it realtes to social media. It is important to note that because social media outlets allow for us to talk about our books, they are additional areas for dessimination of metadata. On the various platforms used, user profiles are highly trafficked, bios are read by many, tweets, syndicated blog posts, messages, and answers to questions are spit out by the million every day. It’s time to get smart about these missives and start loading them with keywords, pushing our SEO strategy to the next level. If every book had, say, 50 keywords that must be included in as many places as possible, social media marketing teams would be hardpressed not to use them thus drawing more attention to a book’s presence. It will not be long before we start using ‘heavy tweets’ in our daily marketing activities. 

Paid Search/SEM: If you are engaging in SEM as well as SEO, it remains important to integrate any paid campaigns with the non-paid ones. Good paid search marketers can tell you exactly what people are typing into Google and how many of them are clicking on a link to your books. This information can and should be used by the rest of the team to develop keywords. Here, it is essential to remember that the conversation is two-way. Making sure all of these layers sync up is more important, I believe, than simply performing the tasks of implementing each. To simply make insular teams responsible for each of these items is to miss the point. There is more metadata than we think, beyond the ISBN, and the tags, and the copy, and the pub date, and the format, and the page count, and the price. There is a whole world of metadata out there that we can exploit. The nature of the internet today is that there is no ‘one-stop shop’. There is only multiplicity. 

There is one last piece of metadata that I wanted to mention, something that someone perhaps much smarter than I will figure out: 

User-gen: Perhaps the most difficult area to navigate, because we have little to no control, user generated content, like customer reviews, is also part of a product’s metadata. It is data about the data. Simply because we do not create it, nor what much control over it’s distribution, does not make it any less effective in describing, identifying, and categorizing our content. In fact, even the smallest items can be of important. Let’s say a certain user is highly active at leaving review on a certain site. Other users who may become familiar with these people may, eventually, begin looking for or even SEARCHING for specific user names in order to follow reviews from a trusted source. With sites like Shelfari, GoodReads, and other sharing sites and options being built into eBooks, user generated content and metadata is going to become perhaps more voluminous that publisher generated metadata. How, in the future, are we going to be able to leverage this data?

Mountains of Data by P. Bradley Robb

Touch a word and gain real-time, global context - that’s the promise of the connected, next-generation eBook. That promise highlights major difference between print and digital books. The advance isn’t in the file format, but rather the level of access, the malleability of a story, the interconnectedness of everything. And the means to achieve that future isn’t through transmedia, or the iPad, or even a radically new file format. No, the key to that future is metadata. Mountains of metadata. 

Metadata, data about data, was a mainstay of publishing long before the term was invented. In a purely theoretical sense, the first story ever composed created the first metadata. Since then, humanity’s been amassing a web of connected stories and a means to sort and classify the. Things like title, author, and publication date are perfect examples of current metadata - these disparate pieces of information don’t necessarily add anything to the story, but they help identify where a book belongs in the greater context of storytelling.

As publishing evolved into a proper industry, the metadata associated with a book likewise evolved into a standardized and efficient system. By the twentieth century, sales catalogs had created a largely standardized system of metadata broken down into a half-dozen or so broad categories. The twentieth century system of metadata is perfect for classifying books in groups, but rather lacking for comparing and expanding on specific books. 

The real hangup of twentieth century metadata is that it operates in a polarized fashion. Certain metadata, like title, author, or ISBN, is really specific and grants little insight into a piece of writing. Other pieces of metadata, like publisher or BISAC Subject Headings, are broad but often gloss over the uniqueness of a particular work. 

The future of metadata, the connected and exceptionally granular variety, holds the hope of finding broad connections through specific fields. The method to that madness involves metadata derived from the inside of a story – the minutia that can be easily overlooked as pieces of a whole. When those pieces are extracted and connected, the Law of Truly Large Numbers (or as we’ve taken to calling it in marketing circles Chris Anderson’s “The Long Tail”) takes over. 

Moving from theory into potential, imagine reading a young adult book where the protagonist is a twelve year-old girl. Under the current system of metadata, finding another book where the lead character is another twelve year-old girl involves a great deal of digging – you can narrow the field with the BISAC, but finding a characters specific age is going to require some elbow grease and a great deal of digging. However, with interior, specific metadata, a few taps can bring up an assortment of similar books. 

Interior, specific metadata also allows for comparison and contrast reading. Imagine, if you would, a modern book on the Spanish-American War. Using our future metadata system, finding a book on the same subject written during the war is an easy task. Fully leveraging the system could combine the two books – side-by-side – showing the differences. 

Like the James Arnold Ross character in Upton Sinclair’s Oil! and want to find other books about oil prospectors, or prospectors in general? If the characters occupation is included in the metadata, you’re just a tap away.

And interior, specific metadata can expand beyond the realm of storytelling, and into the greater information sphere. Imagine a self-guided tour of Europe, following in the footsteps of Hemingway’s The Sun Also Rises. Or tracing Jack London’s Call of the Wild. Or fact checking a former politician’s memoir. 

Interior, specific metadata opens a book up in ways that books have never before experienced. Whether a book is made more topical through interconnectedness or merely exposes a reader to immensely scalable cross-sales, the upshot of specific connected metadata is clear: metadata makes the contents of books more important. Metadata increases the value of eBooks by making them more than books. Metadata can turn an eBook into a discovery portal, a cornerstone for information tangents.

The adoption of interior, specific metadata hinges on one very large step – getting metadata right. As pointed out by a slew of forward-thinkers (Mike Cane, Kassia Krozser, Laura Dawson) though the current state of eBook metadata has been standardized in ONIX, the actual quality of that data has been somewhat lacking. The shift towards interior, specific metadata is an exponential increase over not only the current state of metadata, but also the size of an eBook.

Let me be blunt – in the future, books will be more metadata than content. 
Twitter’s recently launched Annotations program allows 512 bytes of data to be attached to a tweet (a 140 byte packet of information) and will expand to 2048 (2 kB) in the near future. This relationship – a roughly 14:1 metadata-data ratio – might sound a bit extreme where eBooks are considered, but a 3:1 or greater ratio is not out of the question.

Crafting that metadata is going to be the great challenge, the question then becomes, who will do the work?

As the engines that turn manuscripts into published books, publishers have a powerful opportunity to turn metadata generation into another value add. Tacking interior, specific  metadata onto editing and career fostering can help traditional publishers remain as the purveyors of legitimate story. It won’t be glamorous, nor will it be easy, but moving an arguably analog manuscript toward its digital destiny is a herculean task that begs for an established bureaucratic system. 

The other path is to crowdsource internal, specific metadata generation. Leaning on the long tail, a book with even limited appeal made available to those with interest is going to garner support from those willing to bash out proper metadata markup. This system of fan supported creation has been widely leveraged by things from fan subtitling of Japanese animation to the creation and upkeep of Wikipedia. The system is neither perfect, nor certain, but it is cheap. 

As we move further into the future of publishing, the generation of internal, specific metadata will become more important. As evident in the iPad, eBooks as a medium are not competing with other eBooks, but instead with other mediums – audio, interactive, and video – for the attention of the audience. Of the five major sources for infotainment – news, film, music, print and video games – the print storytelling media is primed to move into the first truly digital realm. Unlike film, music, and video games, print storytelling properties are easily compared between each other thanks to a standardized method of delivery – words. And unlike the news media, print storytelling isn’t focused on the immediate, but rather on the lasting. 

All we need to do is get metadata right. It won’t be easy, but metadata-enabled eBooks will be the true shift from analog to digital, and represent the greatest change in publishing since Gutenberg.  

P. Bradley Robb is a writer who currently blogs at Fiction Matters

Rethinking the Marketing Mix: Metadata as marketing priority by Laura Dawson

During this past ISBN Hour on Twitter (every Friday at noon Eastern), I was discussing the use of metadata as SEO. Along with Andy Weissberg at Bowker, I’d just given a presentation about this subject at Digital Book World’s Digitize Your Career editorial and marketing forum, and I was digesting what we’d discussed for those who couldn’t be there.

And one publisher asked me if I felt that a good metadata strategy was better than a straightforward PR campaign for any given book.

My response, at the time, was that it isn’t an either/or situation. You need both. But that got me thinking about the various factors in book marketing – what’s effective and necessary to get your books noticed.

1. A good book. Marketing can solve a lot of things. It cannot solve the problem of a book that is not useful or entertaining. The best books are, of course, both. But even one or the other is sufficient to establish a marketing campaign around.

2. Good metadata. Metadata describes the book. Most book shopping is done online these days – even if people are buying in a physical store, they are doing their research online. Metadata – title, author, ISBN, price, publisher, pub date, description, BISAC codes – is like a big neon sign directing consumers to the book. Without good metadata, you can have the best book in the world and nobody can find out about it. Metadata generates keywords. Those keywords are what your customers are typing into Google. Without good metadata, you’re leaving customers without what they want. It is the ultimate SEO strategy.

3. Word of mouth. This is the least tangible factor – the one that’s the hardest to generate and yet the most important. Creating a word-of-mouth campaign requires that you drop the hard-sell, and appeal to individuals. Is this cost-effective? Well…yes. In a phrase: ENGAGE THE INDIES. The best word-of-mouth comes from independent booksellers. Their livelihoods are on the line. They care passionately about books. If your book is good (see #1), they will sell the hell out of it.

4. Social media. This is word of mouth in another form. There are over 400 million people on Facebook. There are over 105 million people on Twitter. We don’t have any statistics for how many people read blogs, but Technorati indexes over 100 million blogs. The Internet is how people talk to one another when they are not face-to-face. It is how people find out about books. What gets tweeted, blogged about, and shared on Facebook gets indexed in Google. To opt out of social media marketing is to silence yourself and your message. And of course, underpinning social media marketing is…? A good metadata strategy so you can point buyers to where they can purchase the book. So you can use metadata for SEO. So you can track consumers’ responses.

5. Conventional PR campaigns. Yes, people still watch TV. They still read newspapers (albeit online – see #4). They pass a Barnes & Noble with a sign in the window that the author is coming to town. Terry Gross and Don Imus still book authors. These things are – still, even now – important. But without the other four firmly in place, this fifth strategy isn’t going to mean much.

The role of the book marketer is somewhat limited – she cannot control the quality of the book she’s given to market, obviously. She cannot control what booksellers think of the book – however entertaining and/or useful she may find it herself. She cannot control how virally the book will take off on the Internet. Two things she CAN control: how accurate and robust the metadata is, and how much she’s going to spend on a traditional PR campaign.

In the past, publishers sold books to bookstores, and people bought them and read them – and that was the conversation. With an exponential increase in the way people talk to one another, publishers are not dictating the conversation anymore. The marketing process has become democratized. The publisher is only in control of what she owns – the quality of the book she’s acquired, the quality of the metadata around the book, and the amount of money she’s going to spend on advertising. But just because you are not in control of the conversation, doesn’t mean you shouldn’t enter it. Nobody likes being shouted at. Engage your booksellers. Engage your readers. Talk with them, not at them. And back up your conversation with what you can control.

Laura Dawson is a 20+ year veteran of the book industry, specializing in its technology issues. She has worked at Doubleday, Muze, Barnes & Noble.com, SirsiDynix, and as an independent consultant whose clients have included McGraw-Hill, Chuckwalla, Scholastic, Capstone Press, Cengage, Alibris, Barnes & Noble, Ingram, Baker & Taylor, and Bowker.

She lives in Brooklyn with her two daughters, and a hamster named Batman.

Publishd