Sunday, February 5, 2012

Difference between Web 1.0, 2.0, 3.0

World Wide Web, commonly known as ‘WWW’, was first launched in 1991. But as the technology advanced, newer versions in the form of Web 2.0 and Web3.0 came into existence. The latter are obviously considered more advanced and easy to use as compared to the Web 1.0. But most of us are confused regarding the changes and the evolution that took place. This article aims to clear out the differences through the below mentioned points. Web 1.0 Web 1.0 was the first reiteration. Actually it was called ‘The Internet’ not version 1 or Web1.0. It was generally used before 1999 when experts called it the Read -Only era. The main features of ‘The Internet’ not Web 1.0 were hyper-linking and bookmarking of the web pages. It only consisted of online guestbook and framesets. There was no flow or communication between Consumer and the producer of the information. Also, the emails were sent through the HTML form. The best examples of ‘The Internet’ are static websites which were made during the ‘.com evolution’. Web 2.0 Web 2.0 was first introduced in the market by O’Reilly at the brainstorming discussion at media live International in 1999. The information available through Web 2.0 empowered the new generation to develop new concepts like Wiki, Widgets and Video streaming. It also allowed many users to publish their own content through few basic steps, which was not possible in the Web 1.0 or The Internet. Web 2.0 was responsible for the development of various sites that we commonly use today like Twitter, Flickr and Facebook. Web 3.0 Web 3.0 is known as the third generation of World Wide Web. It has everything that we could ever wish for. With the help of Web 3.0, web content was easily carried in the form of natural language. It also consist of micro formats, natural language search, recommendation agents which are commonly known as AI i.e. Artificial Intelligence. We can also find different attributes like Deductive Reasoning, Contextual Search, Evolution of 3D web, Personalized Search and Tailor made Search which was not present in earlier versions. Through such advancements in Web 3.0, we have become empowered to do many things that we may have never dreamed of. But this not the end, as with the time we will get to see more advancements in World Wide Web that will make internet surfing an amazing experience.

Tuesday, February 22, 2011

some seo questions which one should know

1)How is Page Rank calculated by Google?
2)What is Google Algorithm and how does it work?
3)How many types of Links are there and what are they? Which links are preferred by Google and Which not?
4)What is the benefit of Internal Linking and how does that help?
5)What happens if you use the keyword density more or less then its required and what are the consequences?
6)Whenever you write any content, remember that to use the targetted keyword in the top 50% of the paragraph. It becomes very easy for Google to crawl.
7)What are the benefits and drawbacks of Dynamic URL's?
8)If you have a lot of flash on your website, then what are the other options to do a ood SEO?
9)What is a quality link?
10)What exactly Google indexing is and how does it work?
11)Whenever you target any keyword and if using it 4-5 times in an article..always remeber to bold the keyword once and once in italic..
this sounds and shouts at the Google Spider that "Look at me"
I am something important
12)How does Social Networking websites can benefit us?
13)What is the difference between Crawler / Spider / Robot / Spider
14)What are PPC Campaigns?
and how does it benefits
15)What is the difference between Site Map and sitemap.xml?
16)What is IP and how can you hide your IP if you are getting blocked by any website each and everytime?
17)How can you reduce the Bounce rate of your website and normally what should be the bounce rate?
18)What is 3-Way Link Exchange?

Wednesday, July 1, 2009

robot.txt

Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

It works likes this: a robot wants to vists a Web site URL, say http://www.example.com/welcome.html. Before it does so, it firsts checks for http://www.example.com/robots.txt, and finds:

User-agent: *
Disallow: /

The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

There are two important considerations when using /robots.txt:

* robots can ignore your /robots.txt. Especially malware robots that scan the web for security vulnerabilities, and email address harvesters used by spammers will pay no attention.
* the /robots.txt file is a publicly available file. Anyone can see what sections of your server you don't want robots to use.

So don't try to use /robots.txt to hide information.

How to create a /robots.txt file
Where to put it

The short answer: in the top-level directory of your web server.

The longer answer:

When a robot looks for the "/robots.txt" file for URL, it strips the path component from the URL (everything from the first single slash), and puts "/robots.txt" in its place.

For example, for "http://www.example.com/shop/index.html, it will remove the "/shop/index.html", and replace it with "/robots.txt", and will end up with "http://www.example.com/robots.txt".

So, as a web site owner you need to put it in the right place on your web server for that resulting URL to work. Usually that is the same place where you put your web site's main "index.html" welcome page. Where exactly that is, and how to put the file there, depends on your web server software.

Remember to use all lower case for the filename: "robots.txt", not "Robots.TXT.

Syntax
back to top

The simplest robots.txt file uses two rules:

* User-agent: the robot the following rule applies to
* Disallow: the URL you want to block

These two lines are considered a single entry in the file. You can include as many entries as you want. You can include multiple Disallow lines and multiple user-agents in one entry.

Each section in the robots.txt file is separate and does not build upon previous sections. For example:

User-agent: *
Disallow: /folder1/

User-Agent: Googlebot
Disallow: /folder2/

In this example only the URLs matching /folder2/ would be disallowed for Googlebot.

Blocking user-agents
back to top

The Disallow line lists the pages you want to block. You can list a specific URL or a pattern. The entry should begin with a forward slash (/).

* To block the entire site, use a forward slash.

Disallow: /

* To block a directory and everything in it, follow the directory name with a forward slash.

Disallow: /junk-directory/

* To block a page, list the page.

Disallow: /private_file.html

* To remove a specific image from Google Images, add the following:

User-agent: Googlebot-Image
Disallow: /images/dogs.jpg

* To remove all images on your site from Google Images:

User-agent: Googlebot-Image
Disallow: /

* To block files of a specific file type (for example, .gif), use the following:

User-agent: Googlebot
Disallow: /*.gif$

* To prevent pages on your site from being crawled, while still displaying AdSense ads on those pages, disallow all bots other than Mediapartners-Google. This keeps the pages from appearing in search results, but allows the Mediapartners-Google robot to analyze the pages to determine the ads to show. The Mediapartners-Google robot doesn't share pages with the other Google user-agents. For example:

User-agent: *
Disallow: /

User-agent: Mediapartners-Google
Allow: /

How to create a /robots.txt file
Where to put it

The short answer: in the top-level directory of your web server.

The longer answer:

When a robot looks for the "/robots.txt" file for URL, it strips the path component from the URL (everything from the first single slash), and puts "/robots.txt" in its place.

For example, for "http://www.example.com/shop/index.html, it will remove the "/shop/index.html", and replace it with "/robots.txt", and will end up with "http://www.example.com/robots.txt".

So, as a web site owner you need to put it in the right place on your web server for that resulting URL to work. Usually that is the same place where you put your web site's main "index.html" welcome page. Where exactly that is, and how to put the file there, depends on your web server software.

Remember to use all lower case for the filename: "robots.txt", not "Robots.TXT.