Tea
Storage? I am Storage!
I'm trying to cut down on the bandwidth my site sucks. Er ... sorry .. Tannin's site. I don't want to take bits of the site down, and it would be easy enough to just buy more bandwidth, but why should I? Hey, it's a free service I provide, and I'm already paying for 6GB a month, that ought to be enough. Well, it is enough, most months, but only just barely.
I could just take the risk that the site will run over and suck itself dry, and go off-line for a few days at the end of each month, except that there are five other sites I administer, all using the same shared server, and it would take the other five sites down too, which is not acceptable. (The other five only use about one-quarter as much bandwidth, they are not the problem child.) Also, it would probably kill my email, if that matters.
Last month it got slashdotted again and I had to switch the images folder off-line and have the pages generate blank-space 404s for most of the pictures for a couple of days. (The slashdotting gave me one day with about three times the usual amount of traffic. Yes, that's how close to the limit the site runs. Lucky it was a 30-day month!)
So then I got busy cutting down on wastage.
Looking at the server stats, the pageviews split up into three groups:
It's probably fair to make the following generalisations: anyone who has bothered to bookmark the site probably wants to read it, so they are "good" visitors, and very welcome. People following static links, in the main, the same applies to. The search engine crowd, however, are different. Here, some study of the server logs soon tells us, we have a very mixed bag. Roughly, they fall into the following groups:
(continued on next post)
I could just take the risk that the site will run over and suck itself dry, and go off-line for a few days at the end of each month, except that there are five other sites I administer, all using the same shared server, and it would take the other five sites down too, which is not acceptable. (The other five only use about one-quarter as much bandwidth, they are not the problem child.) Also, it would probably kill my email, if that matters.
Last month it got slashdotted again and I had to switch the images folder off-line and have the pages generate blank-space 404s for most of the pictures for a couple of days. (The slashdotting gave me one day with about three times the usual amount of traffic. Yes, that's how close to the limit the site runs. Lucky it was a 30-day month!)
So then I got busy cutting down on wastage.
- I've switched off hotlinking (more or less - it's possible to get around, but would be too much trouble for most people, which is close enough for my purposes).
- I've asked Tannin to split up one of the three main sections (the CPUs one) into smaller files, so that the average random visitor who only visits one page gets half as much text and half as many jpegs. (The interested reader only has to click one link to see the next page as well, so this does not impact on usability.)
- I've moved all the images around to new locations (which breaks existing links to them). (I also moved most of the text, as part of the general reorganisation, but there are redirects for the text file moves, so a good deal of that will be user-transparent.)
- I've put an appropriate robots.txt into the new image folders, so that search engines won't index the images.
Looking at the server stats, the pageviews split up into three groups:
- Direct address or bookmark: 40 - 45%
- Search engines: 40 - 40%
- Static links: 10 - 20%
It's probably fair to make the following generalisations: anyone who has bothered to bookmark the site probably wants to read it, so they are "good" visitors, and very welcome. People following static links, in the main, the same applies to. The search engine crowd, however, are different. Here, some study of the server logs soon tells us, we have a very mixed bag. Roughly, they fall into the following groups:
- People looking for information about particular things, like a certain old motherboard model. These are "good" visitors, in the sense that they are likely to be interested in the site content, and the people Tannin says he wants to visit. (I dunno why, they are probably all horrible geeks like him.)
- People looking for a specific bit of informaton about a specific item: motherboard manuals, hard drive jumper settings, and so on. These are "bad" visitors, insofar as they are (a) not likely to be interested in the site, and (b) unlikely to get any benefit from it anyway - it doesn't try to provide that level of detail.
- People looking for stupid specific things. I mean, what do you make of search terms like "jumpers to set on Intel chipset motherboard to overclock Celeron 366". Some of them are really dumb!
- People looking for vage but probably not very intelligent things: search terms like "best hard drive" or "differences between Pentium 4 and Athlon Thunderbird 1333", or just "CPU" seem unlikely to bring the site interested readers, or bring the searcher useful content - but you'd be amazed how many people search for really, really vague stuff.
- Image searches. It seems likely that 90% + of image searches are just a prelude to copyright violation and/or bandwidth theft.
(continued on next post)