Subsections


8.5 Stable Release Series 7.2

This is a stable release series of Condor. As usual, only bug fixes (and potentially, ports to new platforms) will be provided in future 7.2.x releases. New features will be added in the 7.3.x development series.

The details of each version are described below.


Version 7.2.4

Release Notes:

New Features:

Configuration Variable Additions and Changes:

Bugs Fixed:

Known Bugs:

Additions and Changes to the Manual:


Version 7.2.3

Release Notes:

New Features:

Configuration Variable Additions and Changes:

Bugs Fixed:

Known Bugs:

Additions and Changes to the Manual:


Version 7.2.2

Release Notes:

New Features:

Configuration Variable Additions and Changes:

Bugs Fixed:

Known Bugs:

Additions and Changes to the Manual:


Version 7.2.1

Release Notes:

New Features:

Configuration Variable Additions and Changes:

Bugs Fixed:

Known Bugs:

  • None.

Additions and Changes to the Manual:

  • Added a FAQ entry for Windows describing how machines with miss-configured performance counters may cause the condor_procd to crash.

  • Added a manual page for the command condor_router_history.


Version 7.2.0

Release Notes:

  • A bug in some older Xen kernels can result in Condor errors due to a broken assumption in the condor_procd daemon. See the FAQ entry at section  7.7 for details.

  • A problem has been discovered when using snapshot disks with vm universe VMware jobs, if the path that the condor_vm-gahp uses to refer to the virtual machine's VMX file contains a symbolic link. See the FAQ entry at section  7.3 for details.

  • The name of the Amazon EC2 GAHP binary has changed from amazon-gahp to amazon_gahp. This makes it consistent with the naming of other Condor binaries.

New Features:

  • The default universe for jobs is now vanilla, instead of standard. The default can be changed using the configuration variable DEFAULT_UNIVERSE .

  • VMware vm universe jobs now have any BIOS settings saved in an nvram file in the vmware_dir given in the job's submit file transferred to the execute machine, so that they apply to the job's execution.

  • Daemons that become unresponsive are now killed using the SIGABRT signal, which causes a core file to be dropped. Setting the configuration variable NOT_RESPONDING_WANT_CORE to False will revert to the previous behavior that used the SIGKILL signal.

  • The condor_job_router and the condor_q command with the -better-analyze option now support more ClassAd functions than they previously did. They now support all ClassAd functions, except for those with names beginning with the string stringList.

  • condor_status given the options -submitters -xml no longer emits a single blank line when there are no submitters, instead it prints valid XML output with an empty body.

Configuration Variable Additions and Changes:

  • The HAD configuration variable NEGOTIATOR_STATE_FILE has changed its name to STATE_FILE.

Bugs Fixed:

  • Security Item: A flaw was found and fixed that could allow an unauthenticated user to cause Condor daemons to shut down, and could allow running jobs to be removed from the queue.

  • Fixed a bug that caused condor_dagman to stay in the Condor queue, if condor_dagman was accidentally submitted with an empty DAG input file.

  • condor_submit_dag now generates a .condor.sub file with the submit description file command copy_to_spool set to True, to ease version upgrades while large DAGs are running.

  • Fixed a problem in the condor_startd when using STARTD_SLOT_EXPRS for attributes that are sometimes present and sometimes absent from the machine ClassAd. This is most typical of attributes that enter the machine ClassAd from the job, via STARTD_JOB_EXPRS. When the attribute went away from slot X (for example, because the job on slot X finished), the corresponding SlotX_<AttributeName> attribute was not reliably removed from all of the other slots.

  • Removed some redundant information from the condor_startd advertisements to the condor_collector, from within the private ClassAd that is not user-visible. This fix reduces UDP traffic and memory usage generated by the condor_startd by about 20% in the condor_collector and condor_negotiator daemons.

  • Fixed the condor_master daemon to correctly preserve all command-line arguments when restarting itself. In some cases, not preserving argv[0] confused external utilities that monitor the condor_master process by looking at the output of ps or similar programs. Also, not preserving -pid and -runfor could cause unexpected behavior.

  • Fixed a bug that exhibited itself when the configuration variable NEGOTIATOR_CONSIDER_PREEMPTION was set to False, in which jobs would not be matched to slots in the backfill state. Corrected, slots doing backfill are included in the matchmaking process.

  • The condor_job_router did not work while managing jobs from multiple users when read access to the condor_schedd required authentication. The condor_job_router was also not able to use authentication methods other than FS. Now it can use any authentication method, as long as the resulting identity is listed in the configuration variable QUEUE_SUPER_USERS or the condor_job_router and condor_schedd are running as a Personal Condor in non-root mode.

  • Fixed a bug in the condor_schedd daemon that could cause it to write an incorrect Unique ID to the event log's header.

  • Fixed a bug in the user log reader API that could cause it to incorrectly return a ULOG_NO_EVENT in rare cases.

  • Fixed a bug in the user log reader API that could cause it to crash if the application attempted to re-initialize the ReadUserLog object. The code now detects this condition, and returns an error when the application attempts to re-initialization an already initialized ReadUserLog object.

  • Fixed a bug that limited the size of stdin, stdout, and stderr files in the vanilla universe to 2GBytes.

  • Fixed a bug that could cause the condor_starter to EXCEPT upon completion or eviction of a vm universe job. The error message that appeared in the StarterLog file was
      Write_Pipe: invalid pipe end
    

  • When a held job is removed, the values of the attributes HoldReason, HoldReasonCode and HoldReasonSubCode are moved to LastHoldReason, LastHoldReasonCode and LastHoldReasonSubCode. Before, a hold reason could be lost if a removed job was subsequently held.

  • The executable attribute for amazon grid universe jobs no longer needs to be a valid file path.

  • Improved error reporting when a Xen or VMware command fails in the vm universe.

  • For vm universe jobs, virtual floppy disks are no longer disabled.

  • Fixed a bug introduced in Condor 7.1.4 that caused Condor to ignore the virtual machine status reported by Xen in the vm universe.

  • Fixed a 20-second delay in the start up of the condor_c-gahp and the condor_vm-gahp.

  • Fixed a bug which caused the net mask to be published into the machine ClassAd incorrectly.

  • Fixed a bug introduced in Condor 7.1.4 which could cause any Condor daemon to crash if the level of debugging output D_ALL is enabled when a condor_reconfig command is issued.

  • Fixed a bug introduced in Condor 7.1.4 which caused standard universe jobs to fail to start up, if security authentication, but not encryption was enabled between the submit side and the execute side.

  • Fixed a bug with streaming stdin, stdout, and stderr when using glexec.

Known Bugs:

  • None.

Additions and Changes to the Manual:

  • Initial documentation for dynamic provisioning is available in section  3.13.9.

  • Documentation for Kerberos authentication (see section  3.6.3) and associated configuration variables has been updated.

condor-admin@cs.wisc.edu