iEntry 10th Anniversary RSS Newsletter Advertising
Visit Twellow.com

Google Can Now Index . . . Flash! An Interview with Michael Marshall

As you know, a Flash movie as the index page of a site has always been a major problem with search engine optimization. There's simply no content for the search engines to index.

So when I learned that Google can index the contents of Macromedia Flash movies, I was astonished. It seemed this remarkable discovery had gone virtually unnoticed in the SEO community.

But as you probably know, Google has always been the first to index different types of content: PDF files, .doc files, etc. Google has also made amazing inroads in being able to index dynamic content.

And now they're the first major search engine to index Flash another significant step forward in the SEO industry.

So why has Flash presented such problems in the past?

Background of Macromedia Flash Movies and SEO

With a Flash movie as the main page of a site, the Web site owner is giving up the crucial text necessary to prove to the search engines that the main page is about a particular topic. Without that text, the search engines have nothing to index. Therefore, the main page rarely does well in the rankings, unless off-page factors such as link popularity or link reputation are sufficient to carry the page on their own.

In the past, legitimate work arounds have been few and far between. This made things extremely difficult for businesses who wanted to create a rich user experience with a Flash home page, such as Web design firms, photography studios, graphic design firms, and so forth.

So, these businesses often sacrificed rankings for the user experience, since they could rarely have both while still following all of the guidelines set forth by the search engines.

Introducing . . . Michael Marshall

When I learned that Google is indexing Flash from Michael Marshall, creator of ThemeMaster (http://www.theme-master.com) and chat/forum moderator for our online search engine marketing courses (http://www.onlinewebtraining.com), and when I learned of the fascinating discoveries he'd made, I immediately wanted to interview him for an article.

So let's take a look at what Michael has discovered about Google and Flash.

Question:

Michael, how do we know that Google is now indexing the contents of Flash files? Is there a way that we can search the index just for Flash?

Michael Marshall:

Yes. You can enter your search term in Google, and along with that search term, use the filetype operator and restrict your search to the file extension ".swf". This will search for your search term only in Macromedia Flash files. You should see [FLASH] just before each listing in the results page that is a Flash document.

For example, put the following in the search box at Google:

"Best Free Banner Exchange Market" filetype:swf

Question:

How can we extract the text found in a Flash file to see what Google sees?

Michael Marshall:

Macromedia has a Flash Search Engine SDK (http://www.macromedia.com/software/flash/download/search_engine/) that will give us just what we need. The SDK (Software Development Kit) includes an application named 'swf2html'. Swf2html extracts text and links from a Macromedia Flash .SWF file, and returns the data to stdout or as an HTML document. Swf2html is provided as a compiled application and as a static library for linked library implementation. For complete functionality, see the file Readme.htm included in the SDK.

Question:

Do you have an example of a Flash file that we can see, as well as an example of the text that the Macromedia tool extracted from the Flash file?

Michael Marshall:

Yes. I have an example of each. If you look at the extracted output in Web page form, you will see that it is not very pretty. Nevertheless, you've got lots of SEO-worthy content there, and that's what we are most concerned with. You should visit the Flash presentation itself, mouse over the text, and click the links in the presentation so you can be familiar with the Flash presentation. You can compare where certain text appears in the Flash presentation and where it is found in the extracted text.

Example of Flash file:

http://www.internet-marketing-analysts.com/flash_sample.html

Example of extracted text:

http://www.internet-marketing-analysts.com/extracted_text.html

(Note: This Flash example is based on one of Michael's own products. However, I chose to use it for two reasons: 1) because of the many different types of Flash involved; and, 2) because it is a text-heavy Flash example, as opposed to many other examples of Flash that I could have chosen to use.

Added Note: Be sure to highlight the entire page by clicking on Ctrl A)

In the output file, you'll notice that some text seems to be repeated on multiple lines and one portion of it even appears invisible since the font color comes out white. This is just a side effect of the conversion/extraction tool and is not really invisible text and is not spamming in any manner.

In other words, you're doing nothing wrong when this happens it's simply due to the tool itself and not spamming or true invisible text.

(Continued in Part 2. This is a two-part article.)

Robin Nobles conducts live SEO workshops
(http://www.searchengineworkshops.com) in locations across North
America. She also teaches online SEO training
(http://www.onlinewebtraining.com). Localized SEO training is now
being offered through the Search Engine Academy.
(http://www.searchengineacademy.com) Sign up for SEO tips of the
day at mailto:seo-tip@aweber.com.

News Tags: Search, Google, index, Flash, HTTP, Text
About the author:
Robin Nobles conducts live SEO workshops (http://www.searchengineworkshops.com) in locations across North America. She also teaches online SEO training (http://www.onlinewebtraining.com). Localized SEO training is now being offered through the Search Engine Academy. (http://www.searchengineacademy.com) Sign up for SEO tips of the day at mailto:seo-tip@aweber.com.

18 Comments

prefabrik ev

googlecom has always been on top when it comes to various R&D in the search engine world.

as prefabrik

this article is 4 years old - so you are a bit behind the times..

seo

not sure how accurate this info would be in Google

great info

It was a matter of time before this would happen. I am sure spammers will find a way to game this one, too.

good

not sure how accurate this info would be in Google

kadın diyet zayıflama ve oto kiralama r10seoogle

good informations thank you

Google has always been on

Google has always been on top when it comes to various R&D in the search engine world.

Great stuff

Thanks again Robin

Great information

Thanks for the great article and facts Robin

Great article

Some more great information. Thanks Robin.

interesting....

problem is that flash designers can hide more text and such now for SEO...

Great article

We voted against putting flash on the new law website becuase of the size and crawling concern.  I think their might be some room in the future, depending on how Google does at handling it.

Flash Websites is now good on google

This is very interesting. After a few years ago, there was so many SEO Experts discussed that flash is not good for SEO and Now this is a great news for us.

this article is 4 years old

this article is 4 years old - so you are a bit behind the times

WOW

Interesting article.  And now they can start reading it too.

interesting

There are new things coming out now that google can read flash files, just another black hat way for spammers to take advantage of  the algorithm

great article

not sure how accurate this info would be in Google.

Where is part 2?

I can't see the link for part two, some help would be appreciated. Thanks

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

Post new comment

The content of this field is kept private and will not be shown publicly.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.
7 + 4 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
Featured Headline
Is Bing Making Google Better?
Competition Breeds Better User Experience
3 comments | 9 hours ago
 
Subscribe to WebProNews


Send me relevant info