New pages

From UGCS
Jump to: navigation, search
New pages
Hide bots | Show redirects
(Latest | Earliest) View (newer 20 | ) (20 | 50 | 100 | 250 | 500)
  • 14:39, 25 September 2012Hackathon Points (hist)[1,282 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (Created page with "These are things that should be done around UGCS and their appropriate point value. They are split in two dimensions: * Requires sysadmin / does not require sysadmin * Multiple ...")
  • 21:08, 14 February 2012Feb14 2012 AFS problems (hist)[1,108 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (Created page with "On the morning of Feb 14, 2012, UGCS had a large AFS outage. ==Timeline== All times in PST. * 7:45: alexr notices AFS problems, reboots apollo, athena. * 8:20: jdhutchin starts...")
  • 06:41, 12 September 2011Alerting (hist)[2,584 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (Created page with "We have a variety of automated alerting at UGCS to let us know when things are breaking or already broken. ==Notes on alerts== Some alerts are critical, so it is nice if they go...")
  • 20:38, 14 November 2010Nov 20 maintenance (hist)[516 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: ==Todo== * Reboot coreservers * Reboot shellservers * Reboot kabta/enlil * Reset jdhutchin's password on enlil * Check proper failover with hera/zeus * Check that AFS still functions witho...)
  • 06:26, 14 October 2010Remote System Management (hist)[1,402 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Remote System Management (rsm) is a set of remctl scripts that let a remote user run some common administrative tasks on a remote machine. It is primarily designed to let automated system...)
  • 03:44, 18 August 2010AFS Standards (hist)[897 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: These standards will help AFS failover work well and ensure our stability even if we lose an AFS server. * Every volume that has an RO replica must have one on the same host as the RW vol...)
  • 01:22, 27 July 2010Smokeping (hist)[1,659 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: Smokeping is a network latency monitor powered on RRDTool. It collects data about network latency for various services and reports it through a convenient, graph-powered website. ==Confi...)
  • 11:26, 11 July 2010XMPP (hist)[2,730 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: We want XMPP, so now we have ejabberd running in alpha on Hephaestus. Service will be up and down as various configurations are tried. *Username: <ugcs username> *Domain: hephaestus....)
  • 01:52, 10 July 2010Backup tapes (hist)[432 bytes]Raymondj@ugcs.caltech.edu (Talk | contribs) (adding serial number data on tapes)
  • 00:35, 10 July 20102009-2010 Budget (hist)[903 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: At beginning of summer 2010 we have spent $3500 on the new juniper switch and various network cables. We have $1500 left to spend, and everything needs to be purchased by July 31. Any id...)
  • 09:28, 19 June 2010PGP keyserver (hist)[354 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: We have a public SKS OpenPGP keyserver. http://pgp.ugcs.caltech.edu It is configured through cfengine, relevant files in /afs/ugcs/ugcs-admin/cfengine/hosts/sks/, /afs/ugcs/ugcs-admi...)
  • 07:44, 6 June 2010HTTP Servers (hist)[564 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: ==Goals== The goals behind our web setup are to: * Maintain high uptime for static pages * Maintain high database uptime for dynamic pages that need to use it * Have acceptable to good pe...)
  • 02:12, 17 April 2010NFS servers (hist)[1,887 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: UGCS has two main NFS servers: [Apollo] for shared files and keytabs, [Demeter] for cfengine. We would like to set up cfengine to automatically configure current (and future) NFS servers ...)
  • 02:17, 11 April 2010Ugcs groups (hist)[2,165 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: We would like to add functionality where users can create custom groups in both LDAP and AFS with the ability to manage them (similar to the mailman list system already in place). ==Premi...)
  • 22:00, 26 March 2010AFS Servers (hist)[1,663 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: AFS servers are quite important to us because they help take care of most of our important data. ==Overview== AFS has several different types of file servers. They are generically split ...)
  • 00:35, 19 March 2010Juniper switch (hist)[391 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: The Juniper EX-2200 "mercury" was purchased in March 2010 with our IMSS budget money. It is currently in the process of being set up. Its out-of-band management interfaces is connected t...)
  • 19:37, 10 February 2010FastCGI (hist)[2,809 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: =Problem Definition= ==Problems to solve== * PHP performance is really bad- it takes 0.3s to load the simple test pages we have * Since php is run as a cgi every time, it cannot do opcode ...)
  • 05:25, 10 February 2010Nagios Improvements (hist)[2,720 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: We should update Nagios to better reflect what the servers are doing: ==Services to watch== * apollo - AFS, rsync * athena - AFS * demeter - DHCP, TFTP *)
  • 05:35, 26 January 2010New Switch (hist)[1,582 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: We need a new switch badly. We are running out of network ports and don't have very many gige ports. ==Desired Features== * Gigabit support (a must) * 24 or 48 ports * VMPS / Cisco suppo...)
  • 04:00, 26 January 2010Jan25 Kerberos Incident (hist)[1,413 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: On January 25, many of the shellservers were unable to complete any kerberos operations. The cause was an upgraded kerberos library from testing which did not work well with our existing ...)
(Latest | Earliest) View (newer 20 | ) (20 | 50 | 100 | 250 | 500)
Views
Personal tools
Toolbox