Common Maintenance Tasks (Workstations and Servers)

Revision for “Common Maintenance Tasks (Workstations and Servers)” created on August 26, 2019 @ 09:25:15

TitleContentExcerpt
Common Maintenance Tasks (Workstations and Servers)
The following items should be completed to maintain the health of your workstation or server. For compute clusters, please see <a title="Common Maintenance Tasks for Clusters" href="http://www.microway.com/knowledge-center-articles/common-maintenance-tasks-clusters/">Common Maintenance Tasks (Clusters)</a>.
<h2>Backup non-replaceable data</h2>
Remember that RAID is not a replacement for backups. If your system is stolen, hacked or started on fire, your data will be gone forever. Automate this task or you will forget.
<ul>
<li>For many groups, a weekly or monthly cron job is fine. Write a script calling <code>rsync</code> or <code>tar</code> which writes the files to a separate server, NAS or SAN. Place the script in <code>/etc/cron.weekly/</code> or <code>/etc/cron.monthly/</code></li>
<li>Users with more complex requirements should look at <a title="Amanda Open Source Backup Software" href="https://www.zmanda.com/download-amanda.php" target="_blank" rel="noopener noreferrer">AMANDA</a> or <a href="http://blog.bacula.org/" target="_blank" rel="noopener noreferrer">Bacula</a></li>
<li>Tape backup systems are still available for those who prefer them. <a title="Contact Microway" href="http://www.microway.com/contact/" target="_blank" rel="noopener noreferrer">Contact us</a>.</li>
</ul>
<h2>Verify the health of the drive arrays (RAIDs)</h2>
Drive sectors can go bad silently. Scheduling regular verifies will weed out any issues before they occur. Automate them or you will forget.
<ul>
<li>Linux Software RAID (mdadm) arrays can be easily kicked into verify mode. Many distributions (Red Hat, CentOS, Ubuntu) come with their own utilities. To manually start a verify, run this line for each RAID (as root):
<code>echo check &gt; /sys/block/md#/md/sync_action</code>
Watch the text file <code>/proc/mdstat</code> and the output of <code>dmesg</code> to watch the status of each verify.
</li>
<li>Hardware RAID controllers provide their own methods for automated verifies and alert notification. Reference the controller’s manual.</li>
</ul>
<h2>Monitor system alarms and system health</h2>
<ul>
<li><em>Preferred</em>: learn how to use the IPMI capability of your system for remote monitoring and management. You’ll spend a lot less time trekking to the datacenter.</li>
<li><em>Alternative</em>: listen for system alarms and check for warning LEDs.</li>
</ul>
<strong>Don’t ignore alarms! If you put it off, you’ll soon find that something else is wrong and the system needs major repair.</strong>



Old New Date Created Author Actions
August 26, 2019 @ 09:25:15 Brett Newman
April 2, 2018 @ 08:35:00 Brett Newman
April 2, 2018 @ 08:34:49 [Autosave] Brett Newman
April 24, 2014 @ 15:41:50 Eliot Eshelman
April 21, 2014 @ 09:23:58 Eliot Eshelman
April 21, 2014 @ 09:23:43 [Autosave] Eliot Eshelman
July 21, 2013 @ 14:52:26 Eliot Eshelman
July 21, 2013 @ 14:50:06 Eliot Eshelman
July 21, 2013 @ 10:17:05 Eliot Eshelman
July 17, 2013 @ 23:33:57 Eliot Eshelman
July 17, 2013 @ 23:33:15 Eliot Eshelman
July 17, 2013 @ 23:32:28 Eliot Eshelman
July 17, 2013 @ 23:28:38 Eliot Eshelman
July 17, 2013 @ 22:58:28 Eliot Eshelman
July 17, 2013 @ 22:57:46 Eliot Eshelman

Comments are closed.