Downtime: Database Hard Disk Failure
-
- Game Server Admin and Web Monkey
- Posts: 313
- Joined: Thu Dec 29, 2005 1:46 pm
- Location: Michigan
- Contact:
Downtime: Database Hard Disk Failure
Our database has experienced a Primary Master Hard Disk Failure and is currently down for diagnosis and repair.
We take daily backups of the database, so at worst it may get rolled back by 24 hours.
This down time may be extensive as the hard drive may need to be replaced (we have a spare or two to switch to).
During this downtime of the database, the servers will be down as well as they generally cannot function properly without it.
We take daily backups of the database, so at worst it may get rolled back by 24 hours.
This down time may be extensive as the hard drive may need to be replaced (we have a spare or two to switch to).
During this downtime of the database, the servers will be down as well as they generally cannot function properly without it.
World Leader: Tairis'nàdur
Administrator: Confederation of Planes and Planets
Server Host & Web Monkey: Hala The Heroic Domain of Ysgard
Administrator: Confederation of Planes and Planets
Server Host & Web Monkey: Hala The Heroic Domain of Ysgard
-
- Game Server Admin and Web Monkey
- Posts: 313
- Joined: Thu Dec 29, 2005 1:46 pm
- Location: Michigan
- Contact:
After some shuffling of hard disks, writing a new MBR on a hard disk that was previously a secondary slave and is now a primary master, Lazarus is back up.
The database was completely safe from the hard disk failure. The servers are back up.
The database was completely safe from the hard disk failure. The servers are back up.
Last edited by Themicles on Tue Jun 10, 2008 5:10 pm, edited 1 time in total.
World Leader: Tairis'nàdur
Administrator: Confederation of Planes and Planets
Server Host & Web Monkey: Hala The Heroic Domain of Ysgard
Administrator: Confederation of Planes and Planets
Server Host & Web Monkey: Hala The Heroic Domain of Ysgard
-
- Squire: Church of Pants
- Posts: 711
- Joined: Sun Dec 17, 2006 10:30 pm
- Contact:
+1
*encourages teh clicky to DONATE:*
portal.php
edit: I've replaced the long page widening link with a link to the front page where donation links for both the hosting of the game servers, and the website hosting reside. - Themicles
*encourages teh clicky to DONATE:*
portal.php
edit: I've replaced the long page widening link with a link to the front page where donation links for both the hosting of the game servers, and the website hosting reside. - Themicles
Last edited by Rali'vinee on Tue Jun 10, 2008 6:39 pm, edited 1 time in total.
-
- Honor Guard: Church of Pants
- Posts: 1484
- Joined: Sun Mar 05, 2006 5:08 pm
- Location: Richmond, VA USA
- Contact:
Themi and the Hala Team all Rock!
You guys are the best.
To my fellow players....
I think we can find a few gold coins to toss Themi's way.
Thanks and love as always,
Tremayne
You guys are the best.
To my fellow players....
I think we can find a few gold coins to toss Themi's way.
Thanks and love as always,
Tremayne
Second Star to the Right and Straight on 'til Morning
"If life is a hankerchief, love is the embrodery that makes it more beautiful." - Alexis Dufresne Montjoie
"A Tyrite, a thief, a ranger and a preppy elf were sitting in a bar with a druidess..." -Aranel
"If life is a hankerchief, love is the embrodery that makes it more beautiful." - Alexis Dufresne Montjoie
"A Tyrite, a thief, a ranger and a preppy elf were sitting in a bar with a druidess..." -Aranel
-
- Game Server Admin and Web Monkey
- Posts: 313
- Joined: Thu Dec 29, 2005 1:46 pm
- Location: Michigan
- Contact:
I was about ready to pass back out when I made my last post... just realized the first sentence is incomplete. Fixing it now. 
EDIT:
And now I'm editing this post too...
Some more details: Apparently Lazarus' power cord was mistaken for another one, and Lazarus ended up not plugged into the UPS, while something else that didn't need backup power was. This morning our power started flicking on and off rather rapidly and we woke to the sound of computers turning off and back on along with the warning beeps from the UPS telling us it was switching to backup power.
Many people know that rapidly powering up and then powering down a PC can really trash hard disks. Most cases, you may just end up with a corrupt Master Boot Record, or with Windows 2000 and higher you may end up with a corrupt hard disk controller driver (hint, run a "Recovery Install" from your Windows install disc. DO NOT format! Do not start a fresh install).
This killed the Primary Master hard disk that had the Master Boot Record for starting the computer. It was a Maxtor 30 GB hard disk that came out in 2003. Right around the time Maxtor hard drive longevity started going down hill. Luckily, the Linux OS install along with the database were on another hard drive. Ironically, it was the oldest hard drive in the computer at 13 GBs and a production date in 1999! It still passes health tests even.
I had to write a new MBR to the 13 GB disk so that the system would start again, and rearranged all of the drives with the 30 GB one now missing. This meant I also had to rearrange some config files in Linux, but it's all good now.
The 30 GB hard drive only had a Win2k AS install and some files backed up to it that mostly had copies elsewhere on the network anyway.
For anyone concerned, we do have daily complete database backups that are saved to a whole other computer on the network with much newer and (hopefully) more reliable hardware. Lazarus is next on the block to be rebuilt, and I'll be going with a redundant RAID setup for backups (though the database will be on a single disk to keep write performance up).

EDIT:
And now I'm editing this post too...
Some more details: Apparently Lazarus' power cord was mistaken for another one, and Lazarus ended up not plugged into the UPS, while something else that didn't need backup power was. This morning our power started flicking on and off rather rapidly and we woke to the sound of computers turning off and back on along with the warning beeps from the UPS telling us it was switching to backup power.
Many people know that rapidly powering up and then powering down a PC can really trash hard disks. Most cases, you may just end up with a corrupt Master Boot Record, or with Windows 2000 and higher you may end up with a corrupt hard disk controller driver (hint, run a "Recovery Install" from your Windows install disc. DO NOT format! Do not start a fresh install).
This killed the Primary Master hard disk that had the Master Boot Record for starting the computer. It was a Maxtor 30 GB hard disk that came out in 2003. Right around the time Maxtor hard drive longevity started going down hill. Luckily, the Linux OS install along with the database were on another hard drive. Ironically, it was the oldest hard drive in the computer at 13 GBs and a production date in 1999! It still passes health tests even.

I had to write a new MBR to the 13 GB disk so that the system would start again, and rearranged all of the drives with the 30 GB one now missing. This meant I also had to rearrange some config files in Linux, but it's all good now.
The 30 GB hard drive only had a Win2k AS install and some files backed up to it that mostly had copies elsewhere on the network anyway.
For anyone concerned, we do have daily complete database backups that are saved to a whole other computer on the network with much newer and (hopefully) more reliable hardware. Lazarus is next on the block to be rebuilt, and I'll be going with a redundant RAID setup for backups (though the database will be on a single disk to keep write performance up).
World Leader: Tairis'nàdur
Administrator: Confederation of Planes and Planets
Server Host & Web Monkey: Hala The Heroic Domain of Ysgard
Administrator: Confederation of Planes and Planets
Server Host & Web Monkey: Hala The Heroic Domain of Ysgard
-
- Squire of the Holy Church of Annoyance
- Posts: 68
- Joined: Thu Jul 26, 2007 1:12 am
-
- Game Server Admin and Web Monkey
- Posts: 313
- Joined: Thu Dec 29, 2005 1:46 pm
- Location: Michigan
- Contact:
Too true, but not all "Redundant Array of Inexpensive Drives" configurations have any sort of data security (redundancy). Raid 0 for instance. ;Pwhirlin_merlin wrote:A points if anyone can figure out why this phrase is amusing and how many points you are getting.Themicles wrote:redundant RAID
World Leader: Tairis'nàdur
Administrator: Confederation of Planes and Planets
Server Host & Web Monkey: Hala The Heroic Domain of Ysgard
Administrator: Confederation of Planes and Planets
Server Host & Web Monkey: Hala The Heroic Domain of Ysgard
-
- Squire of the Holy Church of Annoyance
- Posts: 68
- Joined: Thu Jul 26, 2007 1:12 am
Right, but with the acronym being what it is the phrase is still redundant.Themicles wrote:Too true, but not all "Redundant Array of Inexpensive Drives" configurations have any sort of data security (redundancy). Raid 0 for instance. ;Pwhirlin_merlin wrote:A points if anyone can figure out why this phrase is amusing and how many points you are getting.Themicles wrote:redundant RAID

Indeed not. RAID is sometimes said to stand for Redundant Array of Independent Disks.Sable wrote:neither are they always inexpensive![]()
-
- Wearer of the Holy Pants
- Posts: 2645
- Joined: Tue Mar 27, 2007 2:42 pm
- Location: Out and about!
*looks at all the pretty words and is glad there are people here who understand them*
I'm also glad I was unable to log in anyway for RL reasons when the servers were down; so painless; please always arrange things that way!
I'm also glad I was unable to log in anyway for RL reasons when the servers were down; so painless; please always arrange things that way!

13thHour : [Tell] *your alignment has long since passed any possible further move to 'sexy' due to reinventing the scale*
[url=http://wiki.ysgard.org/index.php?title=PCs:Lexy]Lexy on the Wiki![/url]
[url=http://wiki.ysgard.org/index.php?title=PCs:Lexy]Lexy on the Wiki![/url]
Done!Rali'vinee wrote:*encourages teh clicky to DONATE:*
portal.php
