Select your font size 
 
about us products & services consulting & support news & events contact us
Availability is easier to manage than uptime, because uptime has to do with one computer, which might fail, whereas availability has to do with the service, which might be spread across many computers, networks, and power grids.

High Availability vs. High Uptime - Quebec

print this article 
 

Achieving high uptime is a very noble goal. To that purpose, Transparen tends to purchase high-end server hardware that allows us to do things like insert new hot-plug ServeRaid (3L or 4L, these days) cards and initialize new 10-20 SCSI disk enclosures without shutting down the servers or re-initializing the operating systems. In addition, it is why Transparen cares about ensuring a consistent Internet connection, and high-availability hydro-electric power for its servers. But despite all of these things, if uptime is the only factor managed, it is difficult to maintain more than 99% uptime under normal circumstances, and when failures occur, they can take a long time to resolve (sometimes days, not just hours or minutes).

Factors Resulting in High Uptime

Naturally, we believe in following industry best practices where it comes to maintaining high uptime, which include:

  • Using good hardware
  • Ensuring that electric power is highly available
  • Using Uninterruptible Power Supplies (UPS) to prevent short power interruptions from limiting uptime
  • Having redundant parts in the servers, including extra hard drives (RAID configurations), extra network cards, extra power supplies, etc.
  • Practising a conservative approach to software changes - making high-risk changes only when absolutely necessary, while taking regular actions to ensure that stability improvements are implemented promptly.

Despite Best Practices, Good Hardware, and Ideal Environment, Server Uptime is Limited By Single Points of Failure

Despite all these practices, a solitary machine, even with redundant parts, may still fail, because not all of its parts are redundant, and there are still things that can happen that will limit uptime. For instance:

  • More often than one might realize, a localized power outage may occur - one which may not affect a whole building, but which may affect the server. The most common example is a power breaker may flip, or a power cord may be unplugged.
  • A network connection may be severed. This could happen in many ways - the simplest is that the ethernet cable can become unreliable and wiggle slightly free, either on the router or the server. But there could be other ways, including router failures, fried ethernet cards, internet provider problems, etc.
  • The RAID array may collapse. Even though RAID provides hard drive redundancy, there are still parts of the RAID array that can fail and take the whole thing down. These include:
    • The RAID card (or SCSI/SATA/IDE card, if implementing a software RAID)
    • The backplane (i.e. what all the drives plug into)
    • The cabling between the RAID card and the backplane
    • Catastrophic hard drive failures (i.e. multiple hard drive failures beyond the redundancy provided by the RAID configuration)
  • Memory might be defective - Server memory is usually provisioned with error correction codes (ECC), but these may still fail under certain circumstances.
  • Power supplies might go out of commission and require replacement. If the power supply redundancy is not sufficient, then the machine may need to be shut down, although it may be possible to replace parts without necessitating shutdown.

In other words, there are too many points of failure, and therefore the odds are stacked against keeping a single server up for years and years.

Availability is Not Limited By Uptime

But even as individual servers may need to be taken down for maintenance from time to time, either voluntarily, or involuntarily - this does not mean that the 'system' cannot remain available. During such times, the goal is rather to allow the system to continue to operate, only perhaps not as powerfully as when all servers are up. In other words, ideally, if a server goes down, the system should operate a tiny bit slower than normal, but continue to operate. This way, services can be provided continuously, despite hardware problems that occasionally arise.

The benefit is that availability is compromized only when all nodes fail. If each node has a 1% chance of being down on a particular day, then the chance that the whole system will go down on that day is 1%^n + x, where n is the number of nodes, and x is the chance that the clustering solution is configured wrong or has some bug. With 3 nodes, the chance of having a catastrophic failure on one day is then 0.01 % + x, where, due to the nature of the software written for high availability, and the people who are interested in configuring it, x is a very very small number.

Many Single Points of Failure Eliminated

By employing redundant servers configured for high availability, we can eliminate several points of failure:

  • Multiple Internet providers can be used, so if one fails, the other may still work
  • Multiple locations - if power goes down in one place, a server in another placeis likely to still have power and an Internet connection, and be able to take over as a primary server.
  • Multiple servers - if a server (or node) becomes disfunctional, others stand ready to take its place
  • Multiple DNS servers on different IP addresses - if one goes down, the others take over. Raw DNS can be used to provide a kind of load balancing - each time a web browser looks up a web server address, it receives a list of IP addresses (in random order). The web browser tries the servers one by one until one works. In the event of a downed server, the user would experience a slowdown, but not a service disruption.

High Availability is Not An Excuse to Not Do Backups

Just because the system is engineered to never go down, does not mean that the system administrators can rest assured that it will never happen. Even if it is extremely unlikely, it is only a matter of when, not if, a catastrophic failure will occur.... And due to the complexity of the system, when the failure occurs, some pretty damned good backups will be needed to effect a timely restoration.

Most Recent Website and Regional Updates

 Timing Upgrades - Factors Affecting Time Between Purchases for Tech Toys
It is possible to understand client purchase decisions by performing a regression analysis. By forming strategies based on the results, companies can optimize strategic programs to maximize profits.

 
 Personal Shopping Assistants - Turning the Table Against Merchant Databases
Consumers can use technology to watch the merchants who already have been watching them. But to do this, they need a champion.

 
 Operations Research
Links to pages related to Operations Research, which is the methodical study of how to do things better.

 
 Operations Research: Avoiding the Taint of Corporate Espionage
A $200M corporate espionage lawsuit against Westjet uncovers the possibility that Operations Research firms may not fully protect client data. A mechanism is suggested to ensure two competitors do not share the same Operations Research team.

 
 Reviewing the Audit of Management Consulting Engagements in Government
Review of Audit of Management Consulting Engagements in Government (2000/01 Report 4), as well as the 2002 and 2003 follow-ups from the Office of the Auditor General of British Columbia.

 
 Competition in Content Distribution Raises Value of Creativity
Content distributors facing competition on all sides turn to creative types for product differentiation. Using stable cash flow, distributors might acquire creative teams. Instead, a risk mitigating joint venture alternative is proposed.

 
 Adaptive Management
How the Forest Practices Branch of the British Columbia Ministry of Forests describes Adaptive Management, with links to further information about business process optimization.

 
 Appointment to the Saguenay Port Authority
OTTAWA ? The Honourable Lawrence Cannon, Minister of Transport, Infrastructure and Communities today announced the appointment of Mr. Jean-Sébastien Harvey to the board of directors of the Saguenay Port Authority for a term of three years.

 
 The Government of Canada continues its investments in the ports of Rimouski and Matane in Quebec
RIMOUSKI (Quebec) - During his visit to Rimouski today, the Honourable Lawrence Cannon, Minister of Transport, Infrastructure and Communities, announced an investment in the order of $7.3 million in the ports of Rimouski and Matane. The invested funds will go toward different repairs that will maintain the safety of the facilities and keep the ports in optimal operating condition.

 
 The Government of Canada invests in the Port of Gaspé (Sandy Beach) in Quebec
GASPÉ (Quebec) - During his visit to Gaspé, the Honourable Lawrence Cannon, Minister of Transport, Infrastructure and Communities, gave the go-ahead for a $5.5 M project to rebuild Rue du Quai in the Port of Gaspé (Sandy Beach). The first steps will involve finalizing the different preliminary studies required, establishing the road alignment, surveying, preparing a contaminated material management plan, conducting an environmental impact assessment and completing the final plans and specifications.

 
 Summer tour of eastern Quebec
The Honourable Lawrence Cannon, Minister of Transport, Infrastructure and Communities, will make different announcements during a summer tour of eastern Quebec...

 
 Opening of 4-lane divided stretch of highways 73/175
SAGUENAY ? The governments of Canada and Quebec are today announcing the opening of a rebuilt 4-lane divided stretch of Highway 175 between km 208 and km 213, near the northern boundary of the Laurentian Wildlife Reserve, and between km 219 and 227, at the entrance of Saguenay.

 

Google
 
Web transparen.com

Contact Information

Related Information

Linux and Open Source Strategy and Support
Linux provides strategic advantages and unlocks valuable new opportunities to integrate information inflows and outflows.
Remote System Administration
A master system administrator can manage hundreds of computers with ease, which means that companies with 20-30 computers and a full-time system administrator are not fully utilizing this individual's core strengths.
Finding MySQL Databases in Lost+Found
MySQL MYD and MYI and FRM files store the table name in the file name. What if the file names are lost due to hard drive partition table corruption? Transparen can help.
IT Support, System Administration and Data Recovery Prices
Transparen's service goes all the way to the most difficult tasks that other companies cannot complete.
Data Recovery Price Phone Call Scripts
If the drive is malfunctioning - send it directly to people who have invested in knowing what to do (i.e. Transparen). Do not gradually upgrade from the lowest quality service to the highest quality service - this diminishes the chance of success.
Octel and SCO X.25 Integration Troubleshooting in Toronto Ontario
Transparen's UNIX team can assist with a variety of networking and telephony issues having to do with Octel integration, X.25 networking, SCO UNIX troubles, and Ingres db recovery.
SCO or System V or SCO OpenServer Custom Server Tech Support
Many large organizations have legacy servers which have been in service for over 10 years, and no one is left with the expertise to fix problems when they occur. Near the end of their service lives, these servers will require support sooner or later.
Building a Service-Oriented Technology Consulting Company
Imagine calling a tech support line and reaching a human being - not a scripted one - who could solve your tech support issue on the spot.
Transparen's Experience in Web Hosting
Web hosting applications often require extreme uptime. Server hardware comprises one part of how to obtain this level of uptime.
System Administration Computer Support
Transparen works closely with qualified clients to develop robust system administration procedures to minimize downtime, emergency response times, and costs...
Computer Support and Repair Shop Partnership Program
Computer support and repair shops can obtain better profitability by concentrating on what they do best. Perhaps it is selling, and perhaps it is service. Perhaps it is both.
Added Value Tech Support Plans for Entrepreneurs and Computer Support Consultancies
You can't be there all the time - but if you partner with Transparen, then you'll have a useful team of cooperative and highly skilled technical support people behind you, so if you take an hour off, your clients will still be supported.
Overnight and 2-day Computer Repair
April 2006 - Transparen Corporation, based in Surrey BC, launched GVRD-wide computer repair service with FREE pickup and drop-off of computers, overnight service, and two-day service. Ideal for business or individuals.
The Importance of Backups - Why Not To Rely On Data Recovery
Keep regular backups. Keep regular backups. Keep regular backups. Keep regular backups. Keep regular backups. Keep regular backups. Keep regular backups.
Data Recovery for System Administrators and Technical People
System administrators, don't send data recovery jobs to $400/job companies - they concentrate on the 88% of jobs that are super easy, that you could have done yourself. Transparen concentrates on the other 12%, and provides 90% success rate in 3-5 days.
   
 
E C M | © 2003-2007 Transparen Corp.      

Standardized Services: Data Recovery Service / Creative Services / Premium Web Hosting Services / System Administration Tech Support Services
Recent Projects: Full-Service Mortgage and Financing Company / System to manage flights from Vancouver to Tofino / Photo exchange verification service
Our Vancouver BC Server Proudly Hosts: automated parking and revenue control systems, leafside lane at southlands, cost effective alternative power sources, the photo genie, pacific forage bag supply, sunburst medical, neosonic design, roger mahler photography - passionate, intriguing, desirable, the connection between east and west, affordable flights to victoria and tofino, low interest mortgage brokers in vancouver, richmond, surrey, toronto, mortgage brokers in calgary
* Acton Vale * Alma * Amos * Amqui * Asbestos * Baie-Comeau * Baie-Saint-Paul * Barkmere * Beauceville * Beauharnois * Beaupré * Bécancour * Bedford * Belleterre * Beloeil * Berthierville * Blainville * Boisbriand * Bois-de-Filion * Bonaventure * Bromont * Brownsburg-Chatham * Cabano * Candiac * Cap-Chat * Cap-Santé * Carignan * Carleton-Saint-Omer * Causapscal * Chambly * Chandler * Chapais * Charlemagne * Châteauguay * Château-Richer * Chibougamau * Clermont * Coaticook * Contrecoeur * Cookshire-Eaton * Cowansville * Danville * Daveluyville * Dégelis * Delson * Desbiens * Deux-Montagnes * Disraeli * Dolbeau-Mistassini * Donnacona * Drummondville * Dunham * Duparquet * East Angus * Farnham * Fermont * Forestville * Fossambault-sur-le-Lac * Gaspé * Gatineau * Gracefield * Granby * Grande-Rivière * Hudson * Huntingdon * Joliette * Kingsey Falls * Lac-Brome * Lac-Delage * Lachute * Lac-Mégantic * Lac-Saint-Joseph * Lac-Sergent * La Malbaie * La Pocatière * La Prairie * La Sarre * L'Assomption * La Tuque * Laval * Lavaltrie * Lebel-sur-Quévillon * L'Épiphanie * Léry * Lévis * L'Île-Cadieux * L'Île-Perrot * Longueuil * Lorraine * Louiseville * Magog * Malartic * Maniwaki * Marieville * Mascouche * Matagami * Matane * Mercier * Métabetchouan- Lac-à-la-Croix * Métis-sur-Mer * Mirabel * Mont-Joli * Mont-Laurier * Montmagny * Montreal - largest city * Mont-Saint-Hilaire * Mont-Tremblant * Murdochville * Neuville * New Richmond * Nicolet * Normandin * Notre-Dame-de-l'Île-Perrot * Notre-Dame-du-Lac * Otterburn Park * Paspébiac * Percé * Pincourt * Plessisville * Pohénégamook * Port-Cartier * Pont-Rouge * Portneuf * Prévost * Princeville * Quebec - provincial capital * Repentigny * Richelieu * Richmond * Rimouski * Rivière-du-Loup * Rivière-Rouge * Roberval * Rosemère * Rouyn-Noranda * Saguenay * Saint-Basile * Saint-Basile-le-Grand * Saint-Césaire * Saint-Constant * Sainte-Adèle * Sainte-Agathe-des-Monts * Sainte-Anne-de-Beaupré * Sainte-Anne-des-Monts * Sainte-Anne-des-Plaines * Sainte-Catherine * Sainte-Catherine-de- la-Jacques-Cartier * Sainte-Julie * Sainte-Marguerite-Estérel * Sainte-Marie * Sainte-Marthe-sur-le-Lac * Sainte-Thérèse * Saint-Eustache * Saint-Félicien * Saint-Gabriel * Saint-Georges * Saint-Hyacinthe * Saint-Jean-sur-Richelieu * Saint-Jérôme * Saint-Joseph-de-Beauce * Saint-Joseph-de-Sorel * Saint-Lazare * Saint-Lin-Laurentides * Saint-Marc-des-Carrières * Saint-Ours * Saint-Pamphile * Saint-Pascal * Saint-Pie * Saint-Raymond * Saint-Rémi * Saint-Sauveur * Saint-Tite * Salaberry-de-Valleyfield * Schefferville * Scotstown * Senneterre * Sept-Îles * Shawinigan * Sherbrooke * Sorel-Tracy * Stanstead * Sutton * Témiscaming * Terrebonne * Thetford Mines * Thurso * Trois-Pistoles * Trois-Rivières * Valcourt * Val-d'Or * Varennes * Vaudreuil-Dorion * Victoriaville * Ville-Marie * Warwick * Waterloo * Waterville * Windsor