- JoNova - http://joannenova.com.au -

On those recent site troubles….

Posted By Joanne Nova On June 19, 2012 @ 5:18 pm In Global Warming | Comments Disabled

Sorry the site was down twice yesterday. We are still figuring out what happened. It’s probably innocent, but we won’t know for sure until we’ve gone through the logs which are monstrous.

I suspect that restoring all 100k comments (lost to data corruption after the transfer of my blog) may have set the traffic over the limit with recaching from many automated spider bots and what-not’s that needed to update all 824 pages and 100,000 comments at once. The log files of the action on my site the last few days are huge, so it will be some time to sort out what happened. I gather the servers may have thought they were under attack and shut down.

When trouble strikes the site, check out my Twitter account for info. Don’t forget to follow @JoanneNova.

So why did I move the site in May?

Last year we did get a denial of service attack in June. We decided with that and the increasing costs, we needed to move the site to cheaper, more secure US servers. That move, and all the traffic went very smoothly, but cost more than $6000 over the year. And even on the cheaper servers, the ongoing bandwidth was still costing $300 per month, so when a dedicated skeptic, who also managed other wordpress blogs as a business, offered to help out with very reduced costs, I could not help but say “Yes Please”.

Hence the site was moved again to another server in the US last month. Because the site is so large not surprisingly there have been a few hiccups in the weeks since we switched it over. We should be fine, though there may be more hiccups in the coming days. This new server location will be a lot cheaper to operate in the long run.

If I was funded by Exxon we wouldn’t have quite so many drama’s in the switch, because we would be paying commercial rates which would be around $10,000 per annum I hear.  ;-)

I have no doubt the new web manager is doing an excellent job, but obviously he has paid work to attend too, and is packing in this new large role in spare time in between.

Apologies for the inconvenience. I trust readers will understand, and if we do turn up anything untoward or unexpected about the “Suspended” notice, I’ll post an update here. For the moment, assume it was one of those things that happen to large complexes of software and databases.



Here’s a brief synopsis of the events which took place in relation to this website – David T. (web thingy guy).

TLDR; not a government conspiracy.

Jo contacted me about the enormous costs of keeping her website up and I related that I purchase hosting bandwidth at wholesale prices out of the US. So, we took the site down for a couple of days and moved servers. Everything seemed to be OK. Unfortunately, due to unforeseen technical issues unrelated to the new website hosting solution, we lost a whole bunch of data about a week after the move. This issue has since been resolved.

We ended up losing almost a weeks worth of data since the previous backup. That data has been recovered but is still yet to be integrated back into the website. The server went down yesterday as a probable result of being re-crawled by Google though, I’m still analyzing the logfiles to confirm. And seeing as the logs grow by over a megabyte everyday, the analysis is slow going. For those who are not technically inclined here’s a short explanation.

Basically, this website is dynamically generated out of a database. Upon request, the result of the generated page is saved as a cached file. This is so the CPU and database don’t have to do any additional work to render the page on every additional request. Only when someone posts a comment. When Google visits your website, it requests as many pages as it can find. With our comments restored our cache needed to be updated so the CPU got hung up dynamically generating every page across the website from the database rather than as a cache file.

So, as a result, my technical support in the US disabled this website as the CPUs were hanging to the point of almost crashing the server. We stepped through a number of website upgrades and efficiency measures during that afternoon until I was satisfied the website would be OK. I checked the website before going to bed and all seemed fine…

At 2:30am we got another hit on the CPU. What I love about my web host providers in the US is their insane paranoia when it comes to security. The next CPU hang triggered an automated script which disables the web account until action can be taken to rectify the problem. This exists to terminate a denial of service attack so that every account running off the server doesn’t lose their website, and more importantly, their email. First thing this morning I have opened my inbox and discovered Jo’s site has gone down. Within an hour we put a resolution in place and arranged an action plan should the same event happen again. Short of writing my own bot and crawling the website, I don’t know if another crawl will take the site down. But, there is now a plan in place to gather data at a network level(beneath the server) to identify and implement a permanent fix should our current efforts be inadequate.

The state of the website at the moment can best be described as functional. We have recovered most of the comments lost and are working to splice those that are still missing into the database. It has been a complex process of weaving some database building scripts and webpage scrappers while firing them onto flat database dump files to create some execution scripts to run on the live server. This is all to marry three separate datasets together, all with conflicting references and placeholders. Not a simple issue.

I understand that 12 hours can be an eternity in internet time so, whenever any issues arise I do my best to resolve them as soon as possible. I appreciate your understanding in this matter and hope that answers any questions you may have concerning the various issues related to this site.

VN:F [1.9.22_1171]
Rating: 9.7/10 (41 votes cast)

Article printed from JoNova: http://joannenova.com.au

URL to article: http://joannenova.com.au/2012/06/on-those-recent-site-troubles/

Copyright © 2008 JoNova. All rights reserved.