Differences
This shows you the differences between two versions of the page.
Next revision | Previous revisionNext revisionBoth sides next revision | ||
condor:installation:configuration [2011/07/14 13:53] – created, configuration files added garrettheath4 | condor:installation:configuration [2011/07/28 19:14] – garrettheath4 | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== | + | ======Condor |
+ | This page contains copies of the configuration files we use on our system. | ||
- | ===== Global Configuration File ===== | + | =====Global Configuration File===== |
- | <file autoconf | + | <file autoconf |
## | ## | ||
## condor_config | ## condor_config | ||
Line 55: | Line 56: | ||
## Pathnames: | ## Pathnames: | ||
## | ## | ||
+ | ## Where are all of the Condor-related files stored for the entire | ||
+ | ## Condor system? | ||
+ | CondorDir = /mnt/config | ||
+ | |||
## Where have you installed the bin, sbin and lib condor directories? | ## Where have you installed the bin, sbin and lib condor directories? | ||
- | RELEASE_DIR = /usr/local/ | + | RELEASE_DIR = |
## Where is the local condor directory for each host? | ## Where is the local condor directory for each host? | ||
## This is where the local config file(s), logs and | ## This is where the local config file(s), logs and | ||
## spool/ | ## spool/ | ||
- | LOCAL_DIR = $(TILDE) | + | LOCAL_DIR = $(CondorDir)/hosts/$(FULL_HOSTNAME) |
- | # | + | |
## Where is the machine-specific local config file for each host? | ## Where is the machine-specific local config file for each host? | ||
- | LOCAL_CONFIG_FILE = /var/lib/condor/condor_config.local | + | #LOCAL_CONFIG_FILE = $(LOCAL_DIR)/condor_config.local |
+ | # If this computer is the Condor Central Manager, load the central | ||
+ | # manager/master config file. Otherwise, load the worker config file. | ||
+ | # | ||
## Where are optional machine-specific local config files located? | ## Where are optional machine-specific local config files located? | ||
## Config files are included in lexicographic order. | ## Config files are included in lexicographic order. | ||
LOCAL_CONFIG_DIR = $(LOCAL_DIR)/ | LOCAL_CONFIG_DIR = $(LOCAL_DIR)/ | ||
- | # | ||
## Blacklist for file processing in the LOCAL_CONFIG_DIR | ## Blacklist for file processing in the LOCAL_CONFIG_DIR | ||
Line 78: | Line 85: | ||
## WARNING: This is a potential security issue. | ## WARNING: This is a potential security issue. | ||
## If not specificed, the default is True | ## If not specificed, the default is True | ||
- | #REQUIRE_LOCAL_CONFIG_FILE = TRUE | + | REQUIRE_LOCAL_CONFIG_FILE = TRUE |
## | ## | ||
Line 105: | Line 112: | ||
## to specify that each machine has its own file system. | ## to specify that each machine has its own file system. | ||
FILESYSTEM_DOMAIN = cs.wlu.edu | FILESYSTEM_DOMAIN = cs.wlu.edu | ||
+ | |||
+ | ## What machine is your central manager? | ||
+ | CONDOR_HOST = john.cs.wlu.edu | ||
+ | # " | ||
+ | # manager call itself? | ||
+ | CondorHost_RealName = $(CONDOR_HOST) | ||
## This macro is used to specify a short description of your pool. | ## This macro is used to specify a short description of your pool. | ||
Line 131: | Line 144: | ||
## (this can also be specified in the environment) | ## (this can also be specified in the environment) | ||
## Note: the CONDOR_IDS setting is ignored on Win32 platforms | ## Note: the CONDOR_IDS setting is ignored on Win32 platforms | ||
- | # | + | # NOTE: CONDOR_IDS is defined in the machine-specific configuration files |
+ | # | ||
## | ## | ||
Line 142: | Line 156: | ||
## | ## | ||
## To flock to others, define FLOCK_TO. | ## To flock to others, define FLOCK_TO. | ||
+ | |||
+ | ## Join the W&L CS Pool (Orion) | ||
+ | # Designate which machines are members of this pool. | ||
+ | # NOTE: In order to make it easy to automatically change this variable when | ||
+ | # | ||
+ | # value should all be on one line (no line breaks). | ||
+ | PoolMembers = john.cs.wlu.edu, | ||
## FLOCK_FROM defines the machines where you would like to grant | ## FLOCK_FROM defines the machines where you would like to grant | ||
## people access to your pool via flocking. (i.e. you are granting | ## people access to your pool via flocking. (i.e. you are granting | ||
## access to these machines to join your pool). | ## access to these machines to join your pool). | ||
- | FLOCK_FROM = *.cs.wlu.edu | + | FLOCK_FROM = $(PoolMembers) |
## An example of this is: | ## An example of this is: | ||
#FLOCK_FROM = somehost.friendly.domain, | #FLOCK_FROM = somehost.friendly.domain, | ||
Line 154: | Line 175: | ||
## want your jobs to be negotiated at -- thereby specifying the | ## want your jobs to be negotiated at -- thereby specifying the | ||
## pools they will run in.) | ## pools they will run in.) | ||
- | FLOCK_TO = | + | FLOCK_TO = $(CONDOR_HOST) |
## An example of this is: | ## An example of this is: | ||
#FLOCK_TO = central_manager.friendly.domain, | #FLOCK_TO = central_manager.friendly.domain, | ||
Line 176: | Line 197: | ||
## Please see the administrator' | ## Please see the administrator' | ||
## settings, what they' | ## settings, what they' | ||
+ | |||
+ | # Define fully-qualified usernames for users' right definitions | ||
+ | CondorUsers = condor@$(UID_DOMAIN)/ | ||
+ | condor@$(UID_DOMAIN)/ | ||
+ | condor@$(UID_DOMAIN)/ | ||
+ | AdminUsers = koller@$(UID_DOMAIN)/ | ||
+ | RootUsers = root@$(UID_DOMAIN)/ | ||
+ | root@$(UID_DOMAIN)/ | ||
+ | root@$(UID_DOMAIN)/ | ||
## What machines have administrative rights for your pool? This | ## What machines have administrative rights for your pool? This | ||
Line 182: | Line 212: | ||
## (assuming you trust all the users who log into that/those | ## (assuming you trust all the users who log into that/those | ||
## machine(s), since this is machine-wide access you're granting). | ## machine(s), since this is machine-wide access you're granting). | ||
- | ALLOW_ADMINISTRATOR = $(CONDOR_HOST) | + | ALLOW_ADMINISTRATOR = $(AdminUsers) |
## If there are no machines that should have administrative access | ## If there are no machines that should have administrative access | ||
Line 208: | Line 238: | ||
## you install, configure or debug your Condor installation. | ## you install, configure or debug your Condor installation. | ||
## It is important to have this defined. | ## It is important to have this defined. | ||
- | ALLOW_READ = *.cs.wlu.edu | + | ALLOW_READ = $(AdminUsers), |
+ | | ||
#ALLOW_READ = *.your.domain, | #ALLOW_READ = *.your.domain, | ||
#DENY_READ = *.bad.subnet, | #DENY_READ = *.bad.subnet, | ||
Line 224: | Line 255: | ||
## machines to your pool and is a serious security risk. | ## machines to your pool and is a serious security risk. | ||
- | ALLOW_WRITE = $(FULL_HOSTNAME), $(IP_ADDRESS) | + | ALLOW_WRITE = $(AdminUsers), $(CondorUsers), \ |
+ | *@cs.wlu.edu/ | ||
# | # | ||
#DENY_WRITE = bad-machine.your.domain | #DENY_WRITE = bad-machine.your.domain | ||
Line 241: | Line 273: | ||
## Negotiator access. | ## Negotiator access. | ||
## managers. | ## managers. | ||
+ | # | ||
ALLOW_NEGOTIATOR = $(CONDOR_HOST) | ALLOW_NEGOTIATOR = $(CONDOR_HOST) | ||
## Now, with flocking we need to let the SCHEDD trust the other | ## Now, with flocking we need to let the SCHEDD trust the other | ||
Line 251: | Line 284: | ||
## access should only be granted with extreme caution. | ## access should only be granted with extreme caution. | ||
## config access is denied from all hosts. | ## config access is denied from all hosts. | ||
- | #ALLOW_CONFIG = trusted-host.your.domain | + | ALLOW_CONFIG = $(AdminUsers) |
+ | |||
+ | ## Daemon Access added by Garrett Koller (not in default config file) | ||
+ | ## Daemon access. | ||
+ | ## with the daemons of " | ||
+ | ## machines will be acknowledged and appropriate responses will be sent. | ||
+ | # | ||
+ | ALLOW_DAEMON = $(PoolMembers) | ||
+ | |||
+ | ## Client Access added by Garrett Koller (not in default config file) | ||
+ | ## Client access. | ||
+ | ## I allow or deny." | ||
+ | ALLOW_CLIENT = $(PoolMembers) | ||
## Flocking Configs. | ## Flocking Configs. | ||
Line 260: | Line 305: | ||
ALLOW_READ_COLLECTOR | ALLOW_READ_COLLECTOR | ||
ALLOW_READ_STARTD | ALLOW_READ_STARTD | ||
+ | |||
+ | # Clear out any old-style HOSTALLOW settings: | ||
+ | HOSTALLOW_READ = | ||
+ | HOSTALLOW_WRITE = | ||
+ | HOSTALLOW_DAEMON = | ||
+ | HOSTALLOW_NEGOTIATOR = | ||
+ | HOSTALLOW_ADMINISTRATOR = | ||
+ | HOSTALLOW_OWNER = | ||
+ | |||
+ | ## | ||
+ | ## Authentication | ||
+ | ## | ||
+ | ## Authentication added by Garrett Koller (not in default config file) | ||
+ | ## These parameters define how Condor will know whether or not a | ||
+ | ## machine that attempts to communicate with it is who it says it is. | ||
+ | ## Refer to Section 3.6.3 " | ||
+ | ## documentation for more information | ||
+ | |||
+ | # A client processess (run by a normal user on a machine that may or | ||
+ | # may not have Condor installed, such as condor_submit) or another | ||
+ | # Condor daemon (either running locally or remotely) will offer these | ||
+ | # authentication methods when trying to communicate with the Condor | ||
+ | # system daemons. | ||
+ | SEC_CLIENT_AUTHENTICATION = PREFERRED | ||
+ | SEC_CLIENT_AUTHENTICATION_METHODS = PASSWORD, FS, FS_REMOTE | ||
+ | SEC_CLIENT_INTEGRITY = OPTIONAL | ||
+ | |||
+ | # A daemon will accept these forms of authentication when | ||
+ | # communicating | ||
+ | SEC_DEFAULT_AUTHENTICATION = PREFERRED | ||
+ | SEC_DEFAULT_AUTHENTICATION_METHODS = PASSWORD, FS, FS_REMOTE | ||
+ | SEC_DEFAULT_INTEGRITY = OPTIONAL | ||
+ | |||
+ | # Password authentication | ||
+ | # Note: TILDE refers to the ' | ||
+ | SEC_PASSWORD_FILE = $(TILDE)/ | ||
+ | FS_REMOTE_DIR = $(CondorDir)/ | ||
+ | |||
+ | SEC_ADMINISTRATOR_AUTHENTICATION = REQUIRED | ||
+ | SEC_CONFIG_AUTHENTICATION = REQUIRED | ||
+ | SEC_READ_AUTHENTICATION = OPTIONAL | ||
Line 277: | Line 363: | ||
## uncomment the following setting and change the value to " | ## uncomment the following setting and change the value to " | ||
## Note: changing this requires a restart not just a reconfig. | ## Note: changing this requires a restart not just a reconfig. | ||
- | #ENABLE_RUNTIME_CONFIG = False | + | ENABLE_RUNTIME_CONFIG = True |
## Do you want to allow condor_config_val -set to work at all? | ## Do you want to allow condor_config_val -set to work at all? | ||
Line 283: | Line 369: | ||
## uncomment the following setting and change the value to " | ## uncomment the following setting and change the value to " | ||
## Note: changing this requires a restart not just a reconfig. | ## Note: changing this requires a restart not just a reconfig. | ||
- | #ENABLE_PERSISTENT_CONFIG = False | + | ENABLE_PERSISTENT_CONFIG = False |
## Directory where daemons should write persistent config files (used | ## Directory where daemons should write persistent config files (used | ||
Line 297: | Line 383: | ||
## prior to version 6.3.3. | ## prior to version 6.3.3. | ||
## should leave this commented out. | ## should leave this commented out. | ||
- | #SETTABLE_ATTRS_CONFIG = * | + | SETTABLE_ATTRS_CONFIG = * |
## Attributes that can be set by hosts with " | ## Attributes that can be set by hosts with " | ||
Line 326: | Line 412: | ||
## Do you want to use NFS for file access instead of remote system | ## Do you want to use NFS for file access instead of remote system | ||
## calls? | ## calls? | ||
- | #USE_NFS = | + | USE_NFS = |
## Do you want to use AFS for file access instead of remote system | ## Do you want to use AFS for file access instead of remote system | ||
Line 339: | Line 425: | ||
## False, checkpoints will be written to the local SPOOL directory on | ## False, checkpoints will be written to the local SPOOL directory on | ||
## the submission machine. | ## the submission machine. | ||
- | #USE_CKPT_SERVER = | + | USE_CKPT_SERVER = |
## What's the hostname of this machine' | ## What's the hostname of this machine' | ||
Line 369: | Line 455: | ||
## Assume we can use up to 80% of memory and estimate shadow private data | ## Assume we can use up to 80% of memory and estimate shadow private data | ||
## size of 800k. | ## size of 800k. | ||
- | #MAX_SHADOWS_MEM = ceiling($(DETECTED_MEMORY)*0.8*1024/ | + | MAX_SHADOWS_MEM = ceiling($(DETECTED_MEMORY)*0.8*1024/ |
## Assume we can use ~21,000 ephemeral ports (avg ~2.1 per shadow). | ## Assume we can use ~21,000 ephemeral ports (avg ~2.1 per shadow). | ||
## Under Linux, the range is set in / | ## Under Linux, the range is set in / | ||
- | #MAX_SHADOWS_PORTS = 10000 | + | MAX_SHADOWS_PORTS = 10000 |
## Under windows, things are much less scalable, currently. | ## Under windows, things are much less scalable, currently. | ||
## Note that this can probably be safely increased a bit under 64-bit windows. | ## Note that this can probably be safely increased a bit under 64-bit windows. | ||
- | #MAX_SHADOWS_OPSYS = ifThenElse(regexp(" | + | MAX_SHADOWS_OPSYS = ifThenElse(regexp(" |
## Now build up the expression for MAX_JOBS_RUNNING. | ## Now build up the expression for MAX_JOBS_RUNNING. | ||
## due to lack of a min() function. | ## due to lack of a min() function. | ||
- | #MAX_JOBS_RUNNING = $(MAX_SHADOWS_MEM) | + | MAX_JOBS_RUNNING = $(MAX_SHADOWS_MEM) |
- | #MAX_JOBS_RUNNING = \ | + | MAX_JOBS_RUNNING = \ |
- | # | + | ifThenElse( $(MAX_SHADOWS_PORTS) < $(MAX_JOBS_RUNNING), |
- | # | + | $(MAX_SHADOWS_PORTS), |
- | # | + | $(MAX_JOBS_RUNNING) ) |
- | #MAX_JOBS_RUNNING = \ | + | MAX_JOBS_RUNNING = \ |
- | # | + | ifThenElse( $(MAX_SHADOWS_OPSYS) < $(MAX_JOBS_RUNNING), |
- | # | + | $(MAX_SHADOWS_OPSYS), |
- | # | + | $(MAX_JOBS_RUNNING) ) |
Line 411: | Line 497: | ||
## you've set in the CONDOR_IDS environment variable. | ## you've set in the CONDOR_IDS environment variable. | ||
## manual for details on this. | ## manual for details on this. | ||
- | LOCK = | + | LOCK = |
## If you don't use a fully qualified name in your /etc/hosts file | ## If you don't use a fully qualified name in your /etc/hosts file | ||
Line 429: | Line 515: | ||
## your top-level config file for this mode of operation to work | ## your top-level config file for this mode of operation to work | ||
## properly. | ## properly. | ||
- | #NO_DNS = True | + | NO_DNS = False |
## Condor can be told whether or not you want the Condor daemons to | ## Condor can be told whether or not you want the Condor daemons to | ||
Line 477: | Line 563: | ||
## default for this setting is False, since it is more secure this | ## default for this setting is False, since it is more secure this | ||
## way. | ## way. | ||
- | #TRUST_UID_DOMAIN = False | + | TRUST_UID_DOMAIN = False |
## If you would like to be informed in near real-time via condor_q when | ## If you would like to be informed in near real-time via condor_q when | ||
Line 496: | Line 582: | ||
## it being defined to False. Set it to True to allow the shadow to execute | ## it being defined to False. Set it to True to allow the shadow to execute | ||
## arbitrary shell code from the user job. | ## arbitrary shell code from the user job. | ||
- | #SHADOW_ALLOW_UNSAFE_REMOTE_EXEC = False | + | SHADOW_ALLOW_UNSAFE_REMOTE_EXEC = False |
## KEEP_OUTPUT_SANDBOX is an optional feature to tell Condor-G to not | ## KEEP_OUTPUT_SANDBOX is an optional feature to tell Condor-G to not | ||
Line 546: | Line 632: | ||
## The allowed values are case insensitive. | ## The allowed values are case insensitive. | ||
## The default of this parameter if not specified is: stm_use_schedd_only | ## The default of this parameter if not specified is: stm_use_schedd_only | ||
- | #SANDBOX_TRANSFER_METHOD = stm_use_schedd_only | + | SANDBOX_TRANSFER_METHOD = stm_use_schedd_only |
## This setting specifies an IP address that depends on the setting of | ## This setting specifies an IP address that depends on the setting of | ||
Line 557: | Line 643: | ||
## interface if one is available. If it cannot decide which of two interfaces | ## interface if one is available. If it cannot decide which of two interfaces | ||
## to choose from, it will pick the first one. | ## to choose from, it will pick the first one. | ||
- | # | + | # |
+ | # | ||
## | ## | ||
Line 567: | Line 654: | ||
## | ## | ||
- | ALL_DEBUG | + | #TODO |
+ | ALL_DEBUG | ||
MAX_COLLECTOR_LOG = 1000000 | MAX_COLLECTOR_LOG = 1000000 | ||
Line 593: | Line 681: | ||
MASTER_DEBUG = | MASTER_DEBUG = | ||
## When the master starts up, should it truncate it's log file? | ## When the master starts up, should it truncate it's log file? | ||
- | #TRUNC_MASTER_LOG_ON_OPEN | + | TRUNC_MASTER_LOG_ON_OPEN |
MAX_JOB_ROUTER_LOG | MAX_JOB_ROUTER_LOG | ||
Line 621: | Line 709: | ||
## instance of a daemon stopped running. This paramete controls how often | ## instance of a daemon stopped running. This paramete controls how often | ||
## the daemons touch the file (in seconds). | ## the daemons touch the file (in seconds). | ||
- | #TOUCH_LOG_INTERVAL = 60 | + | TOUCH_LOG_INTERVAL = 300 |
###################################################################### | ###################################################################### | ||
Line 679: | Line 767: | ||
JustCPU = ($(CPUBusy) && ($(KeyboardBusy) == False)) | JustCPU = ($(CPUBusy) && ($(KeyboardBusy) == False)) | ||
MachineBusy = ($(CPUBusy) || $(KeyboardBusy)) | MachineBusy = ($(CPUBusy) || $(KeyboardBusy)) | ||
+ | |||
+ | ## If job submitter user is listed here, give the job a high priority. | ||
+ | GreedyUserRank = (Owner == " | ||
+ | + (Owner == " | ||
+ | + (Owner == " | ||
+ | + (Owner == " | ||
+ | + (Owner == " | ||
+ | + (Owner == " | ||
+ | |||
+ | IsGreedyUser | ||
## The RANK expression controls which jobs this machine prefers to | ## The RANK expression controls which jobs this machine prefers to | ||
Line 687: | Line 785: | ||
## By default, RANK is always 0, meaning that all jobs have an equal | ## By default, RANK is always 0, meaning that all jobs have an equal | ||
## ranking. | ## ranking. | ||
- | #RANK = | + | RANK = |
Line 705: | Line 803: | ||
# When should we only consider SUSPEND instead of PREEMPT? | # When should we only consider SUSPEND instead of PREEMPT? | ||
- | WANT_SUSPEND = $(UWCS_WANT_SUSPEND) | + | #WANT_SUSPEND = $(UWCS_WANT_SUSPEND) |
+ | WANT_SUSPEND = ( $(SmallJob) || $(KeyboardNotBusy) \ | ||
+ | || $(IsVanilla) ) \ | ||
+ | && ( $(SUSPEND) | ||
# When should we preempt gracefully instead of hard-killing? | # When should we preempt gracefully instead of hard-killing? | ||
- | WANT_VACATE = $(UWCS_WANT_VACATE) | + | # |
+ | WANT_VACATE = | ||
+ | || $(IsVanilla) | ||
## When is this machine willing to start a job? | ## When is this machine willing to start a job? | ||
- | START = $(UWCS_START) | + | #Note: START is redefined in the local config file of ' |
+ | #START = | ||
+ | START = ( $(CPUIdle) || \ | ||
+ | (State != " | ||
## When should a local universe job be allowed to start? | ## When should a local universe job be allowed to start? | ||
Line 720: | Line 826: | ||
## When to suspend a job? | ## When to suspend a job? | ||
- | SUSPEND = $(UWCS_SUSPEND) | + | #SUSPEND = |
+ | SUSPEND = ( (CpuBusyTime > 2 * $(MINUTE)) \ | ||
+ | && $(ActivationTimer) > 90 ) | ||
## When to resume a suspended job? | ## When to resume a suspended job? | ||
- | CONTINUE = $(UWCS_CONTINUE) | + | #CONTINUE = |
+ | CONTINUE = ( $(CPUIdle) && ($(ActivityTimer) > 5) ) | ||
## When to nicely stop a job? | ## When to nicely stop a job? | ||
## (as opposed to killing it instantaneously) | ## (as opposed to killing it instantaneously) | ||
- | PREEMPT = $(UWCS_PREEMPT) | + | #PREEMPT = |
+ | PREEMPT = ( ((Activity == " | ||
+ | ($(ActivityTimer) > $(MaxSuspendTime))) \ | ||
+ | || (SUSPEND && (WANT_SUSPEND == False)) | ||
## When to instantaneously kill a preempting job | ## When to instantaneously kill a preempting job | ||
Line 931: | Line 1043: | ||
LOG = $(LOCAL_DIR)/ | LOG = $(LOCAL_DIR)/ | ||
SPOOL = $(LOCAL_DIR)/ | SPOOL = $(LOCAL_DIR)/ | ||
- | EXECUTE = $(LOCAL_DIR)/ | + | #EXECUTE = $(LOCAL_DIR)/execute |
+ | # Note: TILDE refers to the ' | ||
+ | EXECUTE = $(TILDE)/execute | ||
BIN = $(RELEASE_DIR)/ | BIN = $(RELEASE_DIR)/ | ||
LIB = $(RELEASE_DIR)/ | LIB = $(RELEASE_DIR)/ | ||
Line 971: | Line 1085: | ||
## This attribute only takes effect on restart of the daemons or at the next | ## This attribute only takes effect on restart of the daemons or at the next | ||
## update time. | ## update time. | ||
- | # LOCK_FILE_UPDATE_INTERVAL = 28800 | + | LOCK_FILE_UPDATE_INTERVAL = 28800 |
## This setting primarily allows you to change the port that the | ## This setting primarily allows you to change the port that the | ||
Line 989: | Line 1103: | ||
## How long are you willing to let daemons try their graceful | ## How long are you willing to let daemons try their graceful | ||
## shutdown methods before they do a hard shutdown? (30 minutes) | ## shutdown methods before they do a hard shutdown? (30 minutes) | ||
- | #SHUTDOWN_GRACEFUL_TIMEOUT = 1800 | + | SHUTDOWN_GRACEFUL_TIMEOUT = 1800 |
## How much disk space would you like reserved from Condor? | ## How much disk space would you like reserved from Condor? | ||
Line 995: | Line 1109: | ||
## partitions, it subtracts the amount it really finds by this | ## partitions, it subtracts the amount it really finds by this | ||
## many megabytes. | ## many megabytes. | ||
- | RESERVED_DISK = | + | RESERVED_DISK = |
## If your machine is running AFS and the AFS cache lives on the same | ## If your machine is running AFS and the AFS cache lives on the same | ||
Line 1017: | Line 1131: | ||
## set to your domain, not $(FULL_HOSTNAME), | ## set to your domain, not $(FULL_HOSTNAME), | ||
## user@UID_DOMAIN won't work. | ## user@UID_DOMAIN won't work. | ||
- | #EMAIL_DOMAIN = $(FULL_HOSTNAME) | + | EMAIL_DOMAIN = mail.wlu.edu |
## Should Condor daemons create a UDP command socket (for incomming | ## Should Condor daemons create a UDP command socket (for incomming | ||
Line 1056: | Line 1170: | ||
## different port ranges for incoming and outgoing connections by | ## different port ranges for incoming and outgoing connections by | ||
## using IN_HIGHPORT/ | ## using IN_HIGHPORT/ | ||
- | #HIGHPORT = 9700 | + | HIGHPORT = 9700 |
- | #LOWPORT = 9600 | + | LOWPORT = 9600 |
## If a daemon doens' | ## If a daemon doens' | ||
Line 1073: | Line 1187: | ||
## | ## | ||
## Daemons you want the master to keep running for you: | ## Daemons you want the master to keep running for you: | ||
- | DAEMON_LIST = MASTER, STARTD, | + | # NOTE: DAEMON_LIST is defined in the local configuration files |
+ | #DAEMON_LIST = MASTER, STARTD, | ||
## Which daemons use the Condor DaemonCore library (i.e., not the | ## Which daemons use the Condor DaemonCore library (i.e., not the | ||
Line 1118: | Line 1233: | ||
## If a daemon dies an unnatural death, do you want email about it? | ## If a daemon dies an unnatural death, do you want email about it? | ||
- | #PUBLISH_OBITUARIES = True | + | PUBLISH_OBITUARIES = True |
## If you're getting obituaries, how many lines of the end of that | ## If you're getting obituaries, how many lines of the end of that | ||
## daemon' | ## daemon' | ||
- | #OBITUARY_LOG_LENGTH = | + | OBITUARY_LOG_LENGTH = |
## Should the master run? | ## Should the master run? | ||
- | #START_MASTER = True | + | START_MASTER = True |
## Should the master start up the daemons you want it to? | ## Should the master start up the daemons you want it to? | ||
- | #START_DAEMONS = True | + | START_DAEMONS = True |
## How often do you want the master to send an update to the central | ## How often do you want the master to send an update to the central | ||
## manager? | ## manager? | ||
- | #MASTER_UPDATE_INTERVAL = 300 | + | MASTER_UPDATE_INTERVAL = 300 |
## How often do you want the master to check the timestamps of the | ## How often do you want the master to check the timestamps of the | ||
## daemons it's running? | ## daemons it's running? | ||
## master restarts them. | ## master restarts them. | ||
- | #MASTER_CHECK_NEW_EXEC_INTERVAL = | + | MASTER_CHECK_NEW_EXEC_INTERVAL = |
## Once you notice new binaries, how long should you wait before you | ## Once you notice new binaries, how long should you wait before you | ||
## try to execute them? | ## try to execute them? | ||
- | #MASTER_NEW_BINARY_DELAY = 120 | + | MASTER_NEW_BINARY_DELAY = 120 |
## What's the maximum amount of time you're willing to give the | ## What's the maximum amount of time you're willing to give the | ||
## daemons to quickly shutdown before you just kill them outright? | ## daemons to quickly shutdown before you just kill them outright? | ||
- | #SHUTDOWN_FAST_TIMEOUT = 120 | + | SHUTDOWN_FAST_TIMEOUT = 120 |
###### | ###### | ||
Line 1154: | Line 1269: | ||
## the exponent used to determine how long to wait before starting | ## the exponent used to determine how long to wait before starting | ||
## the daemon again: | ## the daemon again: | ||
- | #MASTER_BACKOFF_FACTOR = 2.0 | + | MASTER_BACKOFF_FACTOR = 2.0 |
## What's the maximum amount of time you want the master to wait | ## What's the maximum amount of time you want the master to wait | ||
## between attempts to start a given daemon? | ## between attempts to start a given daemon? | ||
## MASTER_BACKOFF_FACTOR, | ## MASTER_BACKOFF_FACTOR, | ||
- | #MASTER_BACKOFF_CEILING = 3600 | + | MASTER_BACKOFF_CEILING = 3600 |
## How long should a daemon run without crashing before we consider | ## How long should a daemon run without crashing before we consider | ||
## it " | ## it " | ||
## of restarts so the exponential backoff stuff goes back to normal. | ## of restarts so the exponential backoff stuff goes back to normal. | ||
- | #MASTER_RECOVER_FACTOR = 300 | + | MASTER_RECOVER_FACTOR = 300 |
Line 1172: | Line 1287: | ||
## Address to which Condor will send a weekly e-mail with output of | ## Address to which Condor will send a weekly e-mail with output of | ||
## condor_status. | ## condor_status. | ||
+ | # NOTE: CONDOR_DEVELOPERS is defined in the local configuration files | ||
# | # | ||
Line 1184: | Line 1300: | ||
## Determine if the Negotiator will honor SlotWeight attributes, which | ## Determine if the Negotiator will honor SlotWeight attributes, which | ||
## may be used to give a slot greater weight when calculating usage. | ## may be used to give a slot greater weight when calculating usage. | ||
- | #NEGOTIATOR_USE_SLOT_WEIGHTS = True | + | NEGOTIATOR_USE_SLOT_WEIGHTS = True |
## How often the Negotaitor starts a negotiation cycle, defined in | ## How often the Negotaitor starts a negotiation cycle, defined in | ||
## seconds. | ## seconds. | ||
- | #NEGOTIATOR_INTERVAL = 60 | + | NEGOTIATOR_INTERVAL = 60 |
## Should the Negotiator publish an update to the Collector after | ## Should the Negotiator publish an update to the Collector after | ||
## every negotiation cycle. It is useful to have this set to True | ## every negotiation cycle. It is useful to have this set to True | ||
## to get immediate updates on LastNegotiationCycle statistics. | ## to get immediate updates on LastNegotiationCycle statistics. | ||
- | #NEGOTIATOR_UPDATE_AFTER_CYCLE = False | + | NEGOTIATOR_UPDATE_AFTER_CYCLE = False |
Line 1214: | Line 1330: | ||
## When a machine is claimed, how often should we poll the state of | ## When a machine is claimed, how often should we poll the state of | ||
## the machine to see if we need to evict/ | ## the machine to see if we need to evict/ | ||
- | #POLLING_INTERVAL | + | POLLING_INTERVAL |
## How often should the startd send updates to the central manager? | ## How often should the startd send updates to the central manager? | ||
- | #UPDATE_INTERVAL | + | UPDATE_INTERVAL |
## How long is the startd willing to stay in the " | ## How long is the startd willing to stay in the " | ||
- | #MATCH_TIMEOUT = | + | MATCH_TIMEOUT = |
## How long is the startd willing to stay in the preempting/ | ## How long is the startd willing to stay in the preempting/ | ||
## state before it just kills the starter directly? | ## state before it just kills the starter directly? | ||
- | #KILLING_TIMEOUT = | + | KILLING_TIMEOUT = |
## When a machine unclaimed, when should it run benchmarks? | ## When a machine unclaimed, when should it run benchmarks? | ||
Line 1244: | Line 1360: | ||
## What's the max " | ## What's the max " | ||
## (1.01), the startd will run the benchmarks serially. | ## (1.01), the startd will run the benchmarks serially. | ||
- | benchmarks_max_job_load = 1.0 | + | benchmarks_max_job_load = 1.01 |
# MIPS (Dhrystone 2.1) benchmark: load 1.0 | # MIPS (Dhrystone 2.1) benchmark: load 1.0 | ||
Line 1271: | Line 1387: | ||
## names of devices you want considered the console, without the | ## names of devices you want considered the console, without the | ||
## "/ | ## "/ | ||
- | CONSOLE_DEVICES = mouse, | + | CONSOLE_DEVICES = console |
Line 1432: | Line 1548: | ||
## be " | ## be " | ||
## as console activity). | ## as console activity). | ||
- | #SLOTS_CONNECTED_TO_KEYBOARD = 1 | + | SLOTS_CONNECTED_TO_KEYBOARD = 1 |
## If there are slots that aren't connected to the | ## If there are slots that aren't connected to the | ||
Line 1447: | Line 1563: | ||
## but you still want jobs to start right away on some of your SMP | ## but you still want jobs to start right away on some of your SMP | ||
## nodes, just increase this parameter. | ## nodes, just increase this parameter. | ||
- | #DISCONNECTED_KEYBOARD_IDLE_BOOST = 1200 | + | DISCONNECTED_KEYBOARD_IDLE_BOOST = 1200 |
###### | ###### | ||
Line 1509: | Line 1625: | ||
## How often should the schedd send an update to the central manager? | ## How often should the schedd send an update to the central manager? | ||
- | #SCHEDD_INTERVAL = 300 | + | SCHEDD_INTERVAL = 300 |
## How long should the schedd wait between spawning each shadow? | ## How long should the schedd wait between spawning each shadow? | ||
- | #JOB_START_DELAY = 2 | + | JOB_START_DELAY = 2 |
## How many concurrent sub-processes should the schedd spawn to handle | ## How many concurrent sub-processes should the schedd spawn to handle | ||
## queries? | ## queries? | ||
- | #SCHEDD_QUERY_WORKERS | + | SCHEDD_QUERY_WORKERS |
## How often should the schedd send a keep alive message to any | ## How often should the schedd send a keep alive message to any | ||
## startds it has claimed? | ## startds it has claimed? | ||
- | #ALIVE_INTERVAL = 300 | + | ALIVE_INTERVAL = 300 |
## This setting controls the maximum number of times that a | ## This setting controls the maximum number of times that a | ||
Line 1526: | Line 1642: | ||
## the condor_schedd will simply relinquish the match associated with | ## the condor_schedd will simply relinquish the match associated with | ||
## the dying shadow. | ## the dying shadow. | ||
- | #MAX_SHADOW_EXCEPTIONS = 5 | + | MAX_SHADOW_EXCEPTIONS = 5 |
## Estimated virtual memory size of each condor_shadow process. | ## Estimated virtual memory size of each condor_shadow process. | ||
Line 1535: | Line 1651: | ||
## submit machines. | ## submit machines. | ||
## The higher the number, the lower priority the shadows have. | ## The higher the number, the lower priority the shadows have. | ||
- | # SHADOW_RENICE_INCREMENT = | + | SHADOW_RENICE_INCREMENT = |
## The condor_schedd can renice scheduler universe processes | ## The condor_schedd can renice scheduler universe processes | ||
Line 1557: | Line 1673: | ||
## but it will ensure that all jobs that can be started will be | ## but it will ensure that all jobs that can be started will be | ||
## started. | ## started. | ||
- | #NEGOTIATE_ALL_JOBS_IN_CLUSTER = False | + | NEGOTIATE_ALL_JOBS_IN_CLUSTER = True |
## This setting controls how often, in seconds, the schedd considers | ## This setting controls how often, in seconds, the schedd considers | ||
Line 1580: | Line 1696: | ||
## queue? | ## queue? | ||
## By default, this only includes root. | ## By default, this only includes root. | ||
- | QUEUE_SUPER_USERS = root, condor | + | # |
+ | # NOTE: QUEUE_SUPER_USERS is defined in the local configuration files | ||
+ | #QUEUE_SUPER_USERS = root, condor | ||
Line 1591: | Line 1709: | ||
## the job is started from scratch. | ## the job is started from scratch. | ||
## setting is only used if USE_CKPT_SERVER (from above) is True. | ## setting is only used if USE_CKPT_SERVER (from above) is True. | ||
- | #MAX_DISCARDED_RUN_TIME = 3600 | + | MAX_DISCARDED_RUN_TIME = 3600 |
## Should periodic checkpoints be compressed? | ## Should periodic checkpoints be compressed? | ||
- | #COMPRESS_PERIODIC_CKPT = False | + | COMPRESS_PERIODIC_CKPT = False |
## Should vacate checkpoints be compressed? | ## Should vacate checkpoints be compressed? | ||
- | #COMPRESS_VACATE_CKPT = False | + | COMPRESS_VACATE_CKPT = False |
## Should we commit the application' | ## Should we commit the application' | ||
Line 1610: | Line 1728: | ||
## How often should the shadow update the job queue with job | ## How often should the shadow update the job queue with job | ||
## attributes that periodically change? | ## attributes that periodically change? | ||
- | #SHADOW_QUEUE_UPDATE_INTERVAL = 15 * 60 | + | SHADOW_QUEUE_UPDATE_INTERVAL = 10 * 60 |
## Should the shadow wait to update certain job attributes for the | ## Should the shadow wait to update certain job attributes for the | ||
Line 1616: | Line 1734: | ||
## attributes as they change? | ## attributes as they change? | ||
## aggressive updates to a busy condor_schedd, | ## aggressive updates to a busy condor_schedd, | ||
- | #SHADOW_LAZY_QUEUE_UPDATE = TRUE | + | SHADOW_LAZY_QUEUE_UPDATE = TRUE |
Line 1630: | Line 1748: | ||
## zero (i.e. the job must be reniced) or the mechanism that | ## zero (i.e. the job must be reniced) or the mechanism that | ||
## monitors CPU load on Win32 systems will give erratic results. | ## monitors CPU load on Win32 systems will give erratic results. | ||
- | #JOB_RENICE_INCREMENT = | + | JOB_RENICE_INCREMENT = |
## Should the starter do local logging to its own log file, or send | ## Should the starter do local logging to its own log file, or send | ||
## debug information back to the condor_shadow where it will end up | ## debug information back to the condor_shadow where it will end up | ||
## in the ShadowLog? | ## in the ShadowLog? | ||
- | #STARTER_LOCAL_LOGGING = | + | STARTER_LOCAL_LOGGING = |
## If the UID_DOMAIN settings match on both the execute and submit | ## If the UID_DOMAIN settings match on both the execute and submit | ||
Line 1642: | Line 1760: | ||
## exit with an error. | ## exit with an error. | ||
## job with the specified UID, even if it's not in the passwd file? | ## job with the specified UID, even if it's not in the passwd file? | ||
- | #SOFT_UID_DOMAIN = FALSE | + | SOFT_UID_DOMAIN = FALSE |
## honor the run_as_owner option from the condor submit file. | ## honor the run_as_owner option from the condor submit file. | ||
## | ## | ||
- | #STARTER_ALLOW_RUNAS_OWNER = TRUE | + | STARTER_ALLOW_RUNAS_OWNER = TRUE |
## Tell the Starter/ | ## Tell the Starter/ | ||
Line 1703: | Line 1821: | ||
## site, uncomment these entries. | ## site, uncomment these entries. | ||
# | # | ||
- | # | + | # |
## If you want expressions only appended for either standard or | ## If you want expressions only appended for either standard or | ||
Line 1716: | Line 1834: | ||
## This can be used to define a default value for the rank expression | ## This can be used to define a default value for the rank expression | ||
## if one is not specified in the submit file. | ## if one is not specified in the submit file. | ||
- | #DEFAULT_RANK | + | DEFAULT_RANK |
## If you want universe-specific defaults, you can use the following | ## If you want universe-specific defaults, you can use the following | ||
Line 1748: | Line 1866: | ||
## | ## | ||
## Who should condor_preen send email to? | ## Who should condor_preen send email to? | ||
- | #PREEN_ADMIN = $(CONDOR_ADMIN) | + | PREEN_ADMIN = $(CONDOR_ADMIN) |
## What files should condor_preen leave in the spool directory? | ## What files should condor_preen leave in the spool directory? | ||
Line 1769: | Line 1887: | ||
JAVA = / | JAVA = / | ||
+ | JAVA_MAXHEAP_ARGUMENT = -Xmx1024m | ||
## JAVA_CLASSPATH_DEFAULT gives the default set of paths in which | ## JAVA_CLASSPATH_DEFAULT gives the default set of paths in which | ||
Line 2031: | Line 2150: | ||
## Index file path of saved credentials. | ## Index file path of saved credentials. | ||
## This file will be automatically created if it does not exist. | ## This file will be automatically created if it does not exist. | ||
- | #CRED_INDEX_FILE = $(CRED_STORE_DIR/ | + | CRED_INDEX_FILE = $(CRED_STORE_DIR)/cred-index |
## condor_credd | ## condor_credd | ||
Line 2120: | Line 2239: | ||
# If this is set to true, then the rest of the QUILL arguments must be defined | # If this is set to true, then the rest of the QUILL arguments must be defined | ||
- | # for quill to function. If it is Fase or left undefined, then quill will not | + | # for quill to function. If it is False or left undefined, then quill will not |
# be consulted by either the scheduler or the tools, but in the case of a | # be consulted by either the scheduler or the tools, but in the case of a | ||
# remote quill query where the local client has quill turned off, but the | # remote quill query where the local client has quill turned off, but the | ||
# remote client has quill turned on, things will still function normally. | # remote client has quill turned on, things will still function normally. | ||
- | #QUILL_ENABLED = TRUE | + | QUILL_ENABLED = FALSE |
# | # | ||
Line 2523: | Line 2642: | ||
## name node (see HDFS_SERVICES) then the specified port will be used | ## name node (see HDFS_SERVICES) then the specified port will be used | ||
## to run name node. | ## to run name node. | ||
- | HDFS_NAMENODE = hdfs:// | + | #HDFS_NAMENODE = hdfs:// |
- | HDFS_NAMENODE_WEB = example.com: | + | #HDFS_NAMENODE_WEB = example.com: |
- | HDFS_BACKUPNODE = hdfs:// | + | #HDFS_BACKUPNODE = hdfs:// |
- | HDFS_BACKUPNODE_WEB = example.com: | + | #HDFS_BACKUPNODE_WEB = example.com: |
## You need to pick one machine as name node by setting this parameter | ## You need to pick one machine as name node by setting this parameter | ||
Line 2585: | Line 2704: | ||
- | ===== Condor Master Local Configuration File ===== | + | =====Central Manager Shared |
- | <file autoconf | + | <file autoconf |
- | CONDOR_HOST = john.cs.wlu.edu | + | # the machine' |
- | + | ||
- | ## Where is the local condor directory for each host? | + | |
- | ## | + | |
- | ## spool/ | + | |
- | LOCAL_DIR = / | + | |
- | + | ||
- | ## Mail parameters: | + | |
- | ## When something goes wrong with condor at your site, who should get | + | |
- | ## | + | |
- | CONDOR_ADMIN = kollerg14@mail.wlu.edu | + | |
- | + | ||
- | ## Full path to a mail delivery program that understands that " | + | |
- | ## means you want to specify a subject: | + | |
- | MAIL = / | + | |
- | + | ||
- | ## Network domain parameters: | + | |
- | ## Internet domain of machines sharing a common UID space. | + | |
- | ## machines don't share a common UID space, set it to | + | |
- | ## UID_DOMAIN = $(FULL_HOSTNAME) | + | |
- | ## to specify that each machine | + | |
- | UID_DOMAIN = cs.wlu.edu | + | |
- | + | ||
- | ## Internet domain of machines sharing a common file system. | + | |
- | ## If your machines don't use a network file system, set it to | + | |
- | ## FILESYSTEM_DOMAIN = $(FULL_HOSTNAME) | + | |
- | ## to specify that each machine has its own file system. | + | |
- | FILESYSTEM_DOMAIN = cs.wlu.edu | + | |
- | + | ||
- | ## The user/group ID < | + | |
- | ## (this can also be specified | + | |
- | ## Note: the CONDOR_IDS setting is ignored on Win32 platforms | + | |
- | CONDOR_IDS = 201.481 | + | |
- | + | ||
- | ## | + | |
- | ## various log files. | + | |
- | ## filesystems and file locking over the years, we HIGHLY recommend | + | |
- | ## that you put these lock files on a local partition on each | + | |
- | ## machine. | + | |
- | ## be sure to change this entry. | + | |
- | ## running as needs to have write access to this directory. | + | |
- | ## you're not running as root, this is whatever user you started up | + | |
- | ## the condor_master as. If you are running as root, and there' | + | |
- | ## condor account, it's probably condor. | + | |
- | ## you've set in the CONDOR_IDS environment variable. | + | |
- | ## manual for details on this. | + | |
- | LOCK = / | + | |
DAEMON_LIST = COLLECTOR, MASTER, NEGOTIATOR, SCHEDD, STARTD, KBDD | DAEMON_LIST = COLLECTOR, MASTER, NEGOTIATOR, SCHEDD, STARTD, KBDD | ||
- | ## Java parameters: | + | ##-------------------------------------------------------------------- |
- | ## | + | ## |
- | ## then set JAVA to the path of your JVM binary. | + | ##-------------------------------------------------------------------- |
- | ## | + | ## Address to which Condor will send a weekly e-mail with output of |
- | ## empty or incorrect. | + | ## condor_status. |
- | JAVA = / | + | CONDOR_DEVELOPERS |
- | JAVA_MAXHEAP_ARGUMENT | + | |
- | + | ||
- | # Designate which machines are members of this pool. | + | |
- | PoolMembers = carl.cs.wlu.edu, fred.cs.wlu.edu | + | |
- | # Allow machines to check the status of Condor | + | |
- | ALLOW_READ = $(ALLOW_READ), | + | |
- | # Allow machines to join this pool | + | |
- | ALLOW_WRITE = $(ALLOW_WRITE), | + | |
- | FLOCK_FROM = $(PoolMembers) | + | |
- | + | ||
- | # Enable debugging of Class Ads | + | |
- | LeaseManager.DEBUG_ADS = True</ | + | |
- | + | ||
- | ===== Worker Local Configuration File ===== | + | |
- | <file autoconf condor_config.local> | + | |
- | CONDOR_DEVELOPERS = NONE | + | |
- | CONDOR_HOST = $(PoolMaster) | + | |
- | COLLECTOR_NAME = Orion | + | |
- | + | ||
- | # If job submitter user is listed here, start the job regardless of | + | |
- | # who might be using the computer at the time. | + | |
- | IsGreedyUser = (Owner == " | + | |
- | || Owner == " | + | |
- | || Owner == " | + | |
- | || Owner == " | + | |
- | || Owner == " | + | |
- | || Owner == " | + | |
- | START = ( ( (KeyboardIdle > $(StartIdleTime)) \ | + | |
- | && ( $(CPUIdle) || \ | + | |
- | | + | |
- | || $(IsGreedyUser) ) | + | |
- | SUSPEND = FALSE | + | |
- | PREEMPT = FALSE | + | |
- | KILL = FALSE | + | |
+ | =====Worker Shared Configuration File===== | ||
+ | <file autoconf condor_config_worker.shared># | ||
+ | # the machine' | ||
DAEMON_LIST = MASTER, STARTD, KBDD | DAEMON_LIST = MASTER, STARTD, KBDD | ||
- | NEGOTIATOR_INTERVAL = 20 | ||
- | TRUST_UID_DOMAIN = TRUE | ||
- | # Join the W&L CS Pool (Orion) | + | ## |
- | FLOCK_TO = john.cs.wlu.edu | + | ## condor_collector |
- | ALLOW_WRITE = $(ALLOW_WRITE), | + | ## |
- | + | ## Address to which Condor will send a weekly e-mail with output | |
- | # Enable debugging | + | ## condor_status. |
- | LeaseManager.DEBUG_ADS | + | # Don't send monthly statistics emails. |
+ | # The central manager will do that. | ||
+ | CONDOR_DEVELOPERS |