New pages

From UGCS
Jump to: navigation, search
New pages
Hide bots | Hide redirects
(Latest | Earliest) View (newer 50 | ) (20 | 50 | 100 | 250 | 500)
  • 14:39, 25 September 2012Hackathon Points (hist)[1,282 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (Created page with "These are things that should be done around UGCS and their appropriate point value. They are split in two dimensions: * Requires sysadmin / does not require sysadmin * Multiple ...")
  • 21:08, 14 February 2012Feb14 2012 AFS problems (hist)[1,108 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (Created page with "On the morning of Feb 14, 2012, UGCS had a large AFS outage. ==Timeline== All times in PST. * 7:45: alexr notices AFS problems, reboots apollo, athena. * 8:20: jdhutchin starts...")
  • 06:41, 12 September 2011Alerting (hist)[2,584 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (Created page with "We have a variety of automated alerting at UGCS to let us know when things are breaking or already broken. ==Notes on alerts== Some alerts are critical, so it is nice if they go...")
  • 20:38, 14 November 2010Nov 20 maintenance (hist)[516 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: ==Todo== * Reboot coreservers * Reboot shellservers * Reboot kabta/enlil * Reset jdhutchin's password on enlil * Check proper failover with hera/zeus * Check that AFS still functions witho...)
  • 06:26, 14 October 2010Remote System Management (hist)[1,402 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Remote System Management (rsm) is a set of remctl scripts that let a remote user run some common administrative tasks on a remote machine. It is primarily designed to let automated system...)
  • 03:44, 18 August 2010AFS Standards (hist)[897 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: These standards will help AFS failover work well and ensure our stability even if we lose an AFS server. * Every volume that has an RO replica must have one on the same host as the RW vol...)
  • 01:22, 27 July 2010Smokeping (hist)[1,659 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: Smokeping is a network latency monitor powered on RRDTool. It collects data about network latency for various services and reports it through a convenient, graph-powered website. ==Confi...)
  • 11:26, 11 July 2010XMPP (hist)[2,730 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: We want XMPP, so now we have ejabberd running in alpha on Hephaestus. Service will be up and down as various configurations are tried. *Username: <ugcs username> *Domain: hephaestus....)
  • 01:52, 10 July 2010Backup tapes (hist)[432 bytes]Raymondj@ugcs.caltech.edu (Talk | contribs) (adding serial number data on tapes)
  • 00:35, 10 July 20102009-2010 Budget (hist)[903 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: At beginning of summer 2010 we have spent $3500 on the new juniper switch and various network cables. We have $1500 left to spend, and everything needs to be purchased by July 31. Any id...)
  • 09:28, 19 June 2010PGP keyserver (hist)[354 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: We have a public SKS OpenPGP keyserver. http://pgp.ugcs.caltech.edu It is configured through cfengine, relevant files in /afs/ugcs/ugcs-admin/cfengine/hosts/sks/, /afs/ugcs/ugcs-admi...)
  • 07:44, 6 June 2010HTTP Servers (hist)[564 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: ==Goals== The goals behind our web setup are to: * Maintain high uptime for static pages * Maintain high database uptime for dynamic pages that need to use it * Have acceptable to good pe...)
  • 02:12, 17 April 2010NFS servers (hist)[1,887 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: UGCS has two main NFS servers: [Apollo] for shared files and keytabs, [Demeter] for cfengine. We would like to set up cfengine to automatically configure current (and future) NFS servers ...)
  • 02:17, 11 April 2010Ugcs groups (hist)[2,165 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: We would like to add functionality where users can create custom groups in both LDAP and AFS with the ability to manage them (similar to the mailman list system already in place). ==Premi...)
  • 22:00, 26 March 2010AFS Servers (hist)[1,663 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: AFS servers are quite important to us because they help take care of most of our important data. ==Overview== AFS has several different types of file servers. They are generically split ...)
  • 00:35, 19 March 2010Juniper switch (hist)[391 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: The Juniper EX-2200 "mercury" was purchased in March 2010 with our IMSS budget money. It is currently in the process of being set up. Its out-of-band management interfaces is connected t...)
  • 19:37, 10 February 2010FastCGI (hist)[2,809 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: =Problem Definition= ==Problems to solve== * PHP performance is really bad- it takes 0.3s to load the simple test pages we have * Since php is run as a cgi every time, it cannot do opcode ...)
  • 05:25, 10 February 2010Nagios Improvements (hist)[2,720 bytes]Adr@ugcs.caltech.edu (Talk | contribs) (New page: We should update Nagios to better reflect what the servers are doing: ==Services to watch== * apollo - AFS, rsync * athena - AFS * demeter - DHCP, TFTP *)
  • 05:35, 26 January 2010New Switch (hist)[1,582 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: We need a new switch badly. We are running out of network ports and don't have very many gige ports. ==Desired Features== * Gigabit support (a must) * 24 or 48 ports * VMPS / Cisco suppo...)
  • 04:00, 26 January 2010Jan25 Kerberos Incident (hist)[1,413 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: On January 25, many of the shellservers were unable to complete any kerberos operations. The cause was an upgraded kerberos library from testing which did not work well with our existing ...)
  • 04:37, 12 January 2010Cisco switches (hist)[253 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: We currently have 3 cisco switches to handle our network traffic. ==Access== You can access them through SSH from charon. ==Names== * Gigabit switch 192.168.1.2 : strangelove * Other two...)
  • 19:43, 28 November 2009Mail Improvements (hist)[1,472 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: It'd be nice if we fixed some issues with our mail system and added more features. ==Backup Server== If hermes dies, we are screwed. We also don't have any testing machines to test any o...)
  • 04:42, 7 October 2009Office Hours (hist)[322 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: =Office Hours= __NOTOC__ In order to better conenct our users, UGCS will try to hold regular office hours. Current office hours times are [b]7:30pm-9pm[/b] in [[Documentation:Lab|our lab...)
  • 07:35, 28 September 2009Postgres (hist)[647 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: We run Postgresql 8.3 on poseidon. By default, users do not get a database, but they can have one created automatically (remctl poseidon postgres createdb). Since the minimum database si...)
  • 06:27, 9 September 2009UGCS Best Practices (hist)[3,260 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: This page aims to document all the things we typically do so that new admins can get up to speed faster. ==Machine Setup== * Use LVM for all machines. If you don't, you're a moron. * Lea...)
  • 07:28, 25 August 2009IPMI (hist)[3,059 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: IPMI is a tool used for remote management of servers. It lets us do many tasks without physically going to the server to mess with it. =Basics= =Specifics= ==Poweredge 860's== ==Athena...)
  • 07:27, 25 August 2009How to get environment information (hist)[1,371 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: The first part of debugging a problem is figuring out what's going on. This page aims to document the myraid ways of figuring out what's going on, and how to expose non-obvious problems. ...)
  • 06:58, 19 August 2009Shellserver Packages (hist)[1,410 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Since we no longer have a shared root, you have to be a little more careful in making changes to clio. =Installing packages= To install a package, first install it on clio. Then, run /...)
  • 06:19, 18 August 2009Installing packages with cfengine (hist)[1,789 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Installing packages with cfengine can be very powerful and a huge time-saver, but it needs to be set up correctly to work. =Overview= CFengine has built-in package manager support. It ta...)
  • 01:54, 15 August 2009Enlil (hist)[418 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Enlil is a sysadmin-only machine. It is to provide us access to our IPMI stuff when all else fails. =Hardware= * 1U Einux machine at the top of the left rack =Network= * Name: enlil.cal...)
  • 02:46, 12 July 2009Initramfs (hist)[398 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Initramfs are archives that the kernel uses to get to its root file system. ==Scripts== You can add your custom scripts in /etc/initramfs-tools/scripts/<dir>. Note that their names mus...)
  • 10:38, 11 July 2009Shellserver Systemimager (hist)[5,525 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: I think we should start working on moving our shellservers to an automated, imaged setup instead of our current read-only root over NFS. There are a couple of reasons: ==Pros== * Read-on...)
  • 02:05, 11 July 2009Virtualized Servers (hist)[521 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: It would be really awesome if we let our users create their own virtualized servers. ==Resources== ==Technology== We would use [http://xen.org Xen] to run the virtualization. Automated ...)
  • 19:17, 4 July 2009Getting Started with UGCS (hist)[2,666 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: UGCS is a medium-sized system with a lot to learn about. This guide attempts to present a logical order for learning how to administer UGCS. =Core infrastructure= ==AFS== AFS is one of t...)
  • 18:33, 9 June 2009New System Setup (hist)[3,559 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: While we don't set up a new machine very often, sometimes it needs to be done. ==Run the Debian Installer== Just use the debian installer for the current release that we are using. See [...)
  • 07:42, 27 May 2009K5start (hist)[393 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: K5start is a command that is used to help non-kerberized services play nicely with kerberos and AFS. It supports getting tokens (using a keytab file), running aklog, and then running the ...)
  • 09:26, 26 May 2009Email Heartbeat (hist)[273 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: This script sends a email through a mail system, and makes sure that it comes out the other end within a reasonable amount of time. If it doesn't, it can send alerts. See hermes:/usr/loc...)
  • 09:08, 26 May 2009Donut (hist)[321 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Donut is ASCIT's fileserver. We have access to it and "maintain" it for them. =Hardware= * Dell desktop =Services= * Apache and postgres for donut * Postfix for mail (with the same spam...)
  • 09:07, 26 May 2009Charon (hist)[258 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Charon is our network router =Hardware= * 1U PowerEdge 860 * Purchased new for UGCS 4.0 (Summer 2007) * Dual-core Xeon 1.86GHz * 2Gb ram * 2x 80gb hard drives * Located in the upper-left ...)
  • 09:01, 26 May 2009Alerts (hist)[2,236 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Splunk runs a bunch of saved searches that can activate alerts. Log in to splunk and go to "admin" and then "saved searches". Splunk saved search scripts are located at charon:/opt/s...)
  • 08:56, 26 May 2009Dionysus (hist)[1,421 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Dionysus is one of UGCS's coreservers. It is intended to be a secondary server for "internal applications" =Hardware= * 2U Dell PowerEdge 2450 * 1 1.4Ghz PIII * 2Gb ram =Network= * IP 1...)
  • 22:09, 23 May 2009Certificate Authority (hist)[1,788 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: We run our own certificate authority for all sorts of reasons. The CA certificate is available at http://ca.ugcs.caltech.edu Category:Sysadmin_Documentation)
  • 19:47, 23 May 2009Nagios (hist)[3,784 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: We currently use [http://www.nagios.org Nagios] as a service monitor. It periodically checks nearly every service in UGCS, and makes sure that the service is at least responsive. It is c...)
  • 07:15, 11 May 2009Man Pages (hist)[483 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: If you make a utility for UGCS, you should make a corresponding man page. Google for tutorials, this is just some helpful info. ==Previewing== Preview with man -l <filename> [[Category...)
  • 19:00, 9 May 2009Splunk Saved Searches (hist)[20 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: If you find a problem that is easily identified through a splunk search, please make a search for it even if you think you've solved the problem. The saved search and alert can let us kno...)
  • 08:17, 6 May 2009Automated Password Reset (hist)[2,008 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: The automated password reset program allows users to semi-automatically reset passwords. It is a series of three scripts: * pwreset_shell.py This is a shell program for the login accoun...)
  • 00:16, 6 May 2009Coreserver Cron (hist)[327 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: Cron stuff should be put in /etc/cron.(hourly,daily,weekly,monthly), if possible. Otherwise, you can add a file to /etc/cron.d (look at files already there for the syntax). Please don't ...)
  • 03:17, 3 May 2009Building Packages (hist)[457 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: You probably want to use dpkg-buildpackage to build a package. Because we have to build two architectures and don't have cross-compiling enabled, you have to build each architecture separ...)
  • 07:33, 2 May 2009SVN (hist)[2,805 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (Copied from the public page)
  • 07:26, 2 May 2009How the website works (hist)[1,426 bytes]Jdhutchin@ugcs.caltech.edu (Talk | contribs) (New page: The UGCS webpage is pulled from the Website: namespace of this wiki. A script on poseidon (/usr/local/sbin/wiki-pull) converts the wiki pages into web pages. The data goes into /afs/.ugc...)
(Latest | Earliest) View (newer 50 | ) (20 | 50 | 100 | 250 | 500)
Views
Personal tools
Toolbox