Google's Flash Indexing Disaster

On July 1st, Google announced that, using technology provided by Adobe, it had enhanced the Google Search Engine to index the text embedded within Flash movies. What followed was bad advice from Google, second-guessing by web developers, and finally a few straight answers.

Google’s initial announcement was so incredibly vague as to render it all but useless. Developers came away knowing that Google was doing something different with their Flash content, but that’s about it.

While Google’s Dion Almaer suggested that search engines have always been black boxes and that it was up to us to discover what had changed through testing, just about everyone else was crying foul.

Google’s credibility was immediately in question due to the obviously bad advice it contained:

"If you prefer Google to ignore your less informative content, such as a "copyright" or "loading" message, consider replacing the text within an image, which will make it effectively invisible to us."

For the record, replacing fast-loading, accessible text content with a bulky image simply to hide it from search engines is never a good idea.

Google’s list of caveats in the announcement were similarly perplexing:

"Googlebot does not execute some types of JavaScript. So if your web page loads a Flash file via JavaScript, Google may not be aware of that Flash file, in which case it will not be indexed."

What types of JavaScript? Established best practice for publishing Flash content is to use the SWFObject JavaScript library to overcome bugs in older browsers, so was Google saying that it would only index Flash content that was authored using broken/outdated HTML-only techniques?

"We currently do not attach content from external resources that are loaded by your Flash files. If your Flash file loads an HTML file, an XML file, another SWF file, etc., Google will separately index that resource, but it will not yet be considered to be part of the content in your Flash file."

Any experienced Flash developer knows that if you are going to have any significant amount of text in your Flash content, your best bet is to stick it in an XML file and load it on the fly, so you don’t have to rebuild your Flash movie whenever you change the content.

Apparently, not only will Google not see Flash content authored this way, but it will track down the XML file anyway and index it as a separate page on your site! That’s right, Google will helpfully direct people searching for your content to the raw XML file that contains it, rather than your slick, Flash front-end.

All this stuff made so little sense, that many developers questioned whether Google was actually able to index any Flash content of consequence. Within a few days, however, the Search Engine War blog was able to verify that Google was indeed indexing Flash content.

Finally, after several days of developer outcry, Google admitted it had left too many questions unanswered, and four days later, it posted a significant update that is well worth reading if you have any Flash content on your site.

Here’s a quick summary of what we now know:

  • The July 1st release didn’t index Flash content inserted with the SWFObject library‘s dynamic publishing method, which writes the Flash content into the page entirely with JavaScript. The recommended static publishing method (where two nested <object> tags are included in the page) was indexed. Google is now deploying an update that supports the dynamic publishing method as well.
  • Text content loaded on-the-fly from an XML file is not yet indexed, but Google is working on fixing this in the near term.
  • Google will do its best to detect when duplicate content is there to provide an HTML alternative to Flash content, and will only display one of the two versions in the search results. No penalty is applied to a site’s search ranking due to duplicate content.

There are still unknowns here, but that will always be the case with the Google search engine. Though it took a few days, Google is answering what questions it can, and responding to developer concerns with enhancements.

Before very long, most of the text within Flash-based web sites will make its way into the Google search index. Nevertheless, uncertainty will remain over how deeply Google is able to probe Flash content for a while yet. Providing non-Flash alternative content will remain an effective means of guaranteeing your most important content a place in the Google index. It also gives users of non-Flash-enabled browsers (like the iPhone) something to look at.

Though Google’s initial message was pretty half-baked, the follow-up has put most of my concerns to rest. How about yours?

Replay

Category: marketing Time: 2008-07-16 Views: 1
Tags:

Related post

  • Google desktop not indexing new emails 2009-09-22

    About a month ago, Google Desktop stopped indexing my outlook emails. The service is still running. I run an update with the Google updater. But there is no change. Any search I run will only return emails going back from about a month ago. What shou

  • How to get Google Desktop to index searchable pdf's 2010-04-10

    I've got a scanner that converts documents to pdfs. The pdf that gets produced is searchable. However when google desktop indexes the file it doesn't appear to be indexing any of the contents of the pdf (although it is indexing the content of other p

  • Google isn't indexing URLs after a redirect that differs only in percent-encoding/decoding? 2010-09-03

    Will Google's crawler refuse to follow redirects when the difference between the redirected-from and redirected-to URLs is solely whether specific characters are percent-encoded or not? For example: www.splunkbase.com/apps/All/4.x/Add-On/app:PDF+Repo

  • Have Google Desktop to index a website without visiting every page manually? 2010-11-04

    Has any enterprising soul figured out how to get Google Desktop to index a website without having to actually visit every page on that site with your browser? or perhaps created an extension which causes one's browser to follow all links unattended?

  • Google is not indexing URLs in my XML SiteMap? 2011-09-14

    I have used multiple sitemap.xml for my product pages and my category pages and problem is google only not index my whole site map's link. /sitemap_main.xml -- URLs submitted:114 // URLs in web index :88 /sitemap_products.xml -- URLs submitted:391 //

  • Why google doesn't index youtube videos in sitemap? 2012-03-19

    I've created a sitemap for youtube hosted videos in my website. you can find it here. http://www.informaincasa.it/wp-content/uploads/sitemap-video.xml I've also created a sitemap for videos hosted on blip.tv The first sitemap while webmasters tool sa

  • Is text in Javascript HMTL scripts taken into account in Google (or other) indexes? 2012-04-19

    I have a HTML page where I update the content of a div to display some introduction text when displaying the page. It is hard for me to create copies of this page for all possibilities and move the text as static content correspondingly. When I have

  • What does Google do with indexed pages returning 403? 2012-05-03

    I guess that Google removes already indexed pages that are now returning the HTTP error 403. Unfortunately I can't find any definitive statement on that matter. Do you know the answer? --------------Solutions------------- As you know Google will even

  • Why is Google still not indexing my !# website? 2012-08-01

    I have been working on a website which uses #! (2minutecv.com), but even after 6 weeks of the site up and running and conforming to the Google hash bang guidelines stated here, you can still see that Google still hasn't indexed the site yet. For exam

  • How can I force Google to re-index my site? 2012-09-22

    I changed the structure of my URLs. The pages are already indexed by Google and have the following structure: /myfolder/page.apsx The new structure is: /page.aspx Now all URLs that Google knows are wrong. How can I tell Google to re-index and that th

  • How to get Google Search to index a Google Group? 2012-10-16

    I created a group three days ago. It is a private group (meaning people need permission to become members) but everyone can ask to join. I specifically marked "List this group in the directory", and yet I can't find my group in a Google search.

  • Google Webmaster not indexing my site 2013-03-05

    I've moved my site from ak.net84.net to aramk.com, and I've redirected the pages from the old to the new using these methods in my htaccess for the old site: all pages redirect to the new site using a rewrite rule, with 301 http code manually specifi

  • Google Webmaster Tools Index Status 0 for one year 2013-04-21

    I've read this topic with answers but they do not address why zero for one year? but other areas in Google Webmaster Tools report 400 URL indexed. Google Webmaster Tools Index Status Only 2 Indexed. Google Webmaster Tools Sitemaps Panel 493 Submitted

  • What to do if Google starts to index fewer pages from your site? 2013-09-07

    How do I find out why Google is now indexing fewer pages from my site? Here is what Google's help says although I don't find it that helpful: A steady increase in the number of crawled and indexed pages indicates that Google can regularly access your

  • Does Google provide different Index status for HTTP and HTTPS? 2014-05-29

    Will Google provide different index statuses for HTTP and HTTPS? Examples: http://www.example.com and https://www.example.com Or it will provide same results for both? If it doesn't provide the same results, can we configure both HTTP and HTTPS in Go

  • Google is re-indexing pages after redirecting URLs from HTTP to HTTPS incorrectly 2014-08-21

    I upgraded my site so that all pages have gone from using HTTP to HTTPS. I didn't consider that Google treats HTTPS pages differently than HTTP. I recreated my sitemap to so that all links now reflect the new HTTPS URLs and let it be for a few days.

  • How can you tell google to stop indexing a resource? 2014-11-03

    We have an app we are migrating. The old website had deeply nested robots.txt files at some point and then the old developers started handling those request with redirects to the root page. So: a request to http:://example.com/foo/bar/robots.txt is n

  • Does Google OCR and index text in the images? 2015-01-13

    A lot of the images that I post are schematics and diagrams. In those, about 30% of the "ink" is in the callouts and labels. I believe, readers comprehend better when the text is on the figure, compared to putting it in the caption under the fig

  • Will providing an internal graph allow the Google bot to index my site faster, and update it more often? 2015-01-20

    A programmer, but a very inexperienced webmaster here. I am building a social site that gets content generated literally every day. On top, the content of my already indexed pages changes quite often (as I am pulling a lot of stuff from Twitter etc).

iOS development

Android development

Python development

JAVA development

Development language

PHP development

Ruby development

search

Front-end development

Database

development tools

Open Platform

Javascript development

.NET development

cloud computing

server

Copyright (C) avrocks.com, All Rights Reserved.

processed in 0.732 (s). 13 q(s)