1 Introduction
For more than a decade, the STAR Collaboration has been searching for evidence of chiral magnetic effects (CME) [1-3]. CME [4, 5] refers to the induction of an electric current (
A set of observables common to CME searches are the charge-separation fluctuations perpendicular to ΨRP, e.g. with a three-point correlator [6], γ (ϕα + ϕβ -2ΨRP), where averaging is done over all particles in an event and over all events. To draw firm conclusions on the presence of CME, an effective way is needed to disentangle the signal and background contributions, the latter of which are intertwined with collective flow. Collisions of isobaric nuclei, e.g.
In 2018, STAR collected data from isobar collisions,
2 Blinding Techniques
2.1 General principle
Blind analyses often rely on a “reference sample” and an inability to differentiate two or more samples or a particular sample from the reference (see Ref. [9] for a brief overview of blind analyses in particle physics). The reference sample is often used either to tune an analysis without pre-determined bias or to provide a reference for evaluating the significance of a result, e.g. eliminating placebo effects or genetic conditions that may bias the result of medical studies.
2.2 Considerations
While many possibilities exist, the blinding method for a particular analysis should be well-matched to the specific needs of that analysis. For example, many of the typical methods (see Ref. [9] and references therein) do not specifically address the situation of comparing parallel analyses of two different but quite similar data samples. Among the specific considerations for analysis of the 2018 STAR isobar data are the following:
• The un-blind data should not be accessible by physics analysts prior to analysis tuning.
• Accounting for time-dependent detector fluctuations is a critical component of analysis quality assurance (Q/A).
• Accounting for run-by-run anomalies is a critical component of final analysis Q/A.
• Methods to blind by “hiding” or “offsetting” variables or information needed to gain sensitivity to signals are quite common in the literature, e.g. Refs. [10] and [11]. We found many of these methods not well-suited to our analysis. In many cases, randomizing variables within an event may severely compromise the quality of analysis Q/A and associated corrections. For example, randomizing the sign of charged particle tracks would prevent charge-dependent efficiency corrections; and randomizing particle azimuthal angle would destroy correlations from secondary decays. Because of these considerations, such methods are not retained as part of this procedure.
• To ensure the isobar species have statistically comparable behaviors in terms of luminosity, event trigger composition, energy, vertex distribution, occupancy of tracks, etc., the 2018 RHIC run involved frequent switching of the isobar collision species.
• With this consideration in mind, it is feasible to interleave or “mix” events from the two collision species in a given output data file as an efficient method to disguise the collision species.
• Certain STAR experts, recused from blind physics analyses, may require isobar information during RHIC running to ensure data of sufficient quality to achieve target physics goals.
• Calibration experts, who are recused from conducting blind physics analyses, may need access to un-blind data to ensure sufficiently robust calibrations and corrections to achieve the desired physics goals.
• Runs of quality suitable for inclusion in physics analyses, e.g. not exhibiting large detector inefficiencies, must proceed prior to the mixing of events from different species.
For the blind analysis of isobar data collected in 2018, STAR adopted a three-step blinding procedure. For the first step, analysts are provided output data files that mix events from the two isobar collision species, while respecting the time-dependence of run conditions. Analysts use this data sample to perform time-dependent Q/A of the data and to tune analysis codes. At the conclusion of these studies, analysts commit their code to a repository. In the second step, analysts are provided an “unmixed-blind,” sample suitable for calculating corrections that vary according to individual ≈30-minute data-taking runs. The run identification numbers are disguised, but the output data files do not mix events from different runs. Only these “run-by-run” corrections (e.g. for changing detector efficiencies) and code alterations subsequent to these corrections are allowed during this step. At the conclusion of these studies, the final codes are committed to the repository, so that differences may be evaluated. After the analysis codes are verified, the final data analysis pass is completed using these final codes and the fully un-blind data released.
2.3 Initial procedure
Initial implementation of the analysis blinding procedure began prior to and during the 2018 RHIC run. To the extent possible, information pertaining to the isobar species was restricted during the run. Access to raw data for purposes of Q/A during the run was restricted to identified experts, approximately 5% of the collaboration, recused from blind physics analyses. To the extent possible, all raw data samples were limited in size below the level needed for sensitivity to a CME signal, e.g. less than 10000 events. Un-blind experts produced species-blind performance plots to evaluate data quality for the run in-progress.
Prior to the software production of the blind data, it was necessary to set detector calibrations and determine an appropriate list of quality data-taking runs. Due to the importance of robust calibrations to the physics analyses, these calibrations were performed by the relevant experts using un-blind data. These calibration experts were recused from participation in blinded physics analyses. Additionally, a committee was designated to determine data-taking runs of sufficient quality for inclusion in physics analyses. Members of this run selection committee were also recused from participation in blinded physics analyses. Production of the blind data commenced after calibrations and the designation of good runs.
No physics analysis groups are provided with un-blinded data prior to completion of the un-blinding procedure.
2.4 Blind data production
In the blind production of data, the following information encoded in the data stream (DST) are obfuscated: the identification numbers for the event, its particular data-taking run, and RHIC fill; the event timestamp; the event collision species; and the hit rates for the east and west STAR zero-degree calorimeters (ZDC) [12] and beam-beam counters (BBC) [13], as well as their coincidence and background rates. All output data files are assigned a generic name and pseudo-run-number that monotonically increases with time. The exact start time of a data production is not known to ensure, e.g. that a particular pseudo-run-number is not trivially related to a particular isobar species. The mixing procedure and exact algorithm to re-assign pseudo-run numbers are encrypted and only known by two experts, who are recused from performing blind physics analyses. The reference sample, species-separated samples, and fully unblind samples are provided in a three-step process.
2.5 Step-1: “The Reference”
Analysts are initially provided output files composed of events from a mix of the two isobar species. The mixing procedure is not a priori known. As much as possible, the order of events respects temporal changes in running conditions. Events showing peculiar discrepancies from the initial Q/A are excluded from the sample, and events from the two species are only combined if the detector performance, e.g. acceptance, was similar for the two events. Events are randomly rejected at the level of ∼10%, so that the species cannot be determined, e.g. by counting the number of events associated with a particular run or event trigger and correlating it with information from the run log database. Analysis code and time-dependent Q/A are tuned on this reference sample, committed to the analysis code repository, and kept unchanged at this stage. Among other aspects, this step enables extraction of time-dependent spectra for Q/A, detection of time-dependent anomalies, detection of secondary decays and measurement of peak widths relevant to momentum resolution.
2.6 Step-2: “The run by run Q/A sample”
After analysis of the reference data, analysts are provided an “unmixed-blind sample” comprised of files that obscure the true run number (and, hence, the isobar species) but do not mix events across different runs. The pseudo-run-number uniquely maps to one true run number and one (unknown) isobar species. The data are provided in such a way that a mix of files from each species appear in the same directory. As in the first step, a fraction of events from each run is rejected to ensure that simple counting of events could not decipher the species. This sample enables species-blind run-by-run Q/A. Only run-by-run corrections and code alteration directly resulting from these corrections are allowed at this stage. The number of events provided per file is tuned so that statistics are sufficient for robust corrections but insufficient for deciphering the isobar species.
2.7 Step-3: Full Un-blinding
Once Q/A is complete and analyses of the run-by-run Q/A data are final, full un-blinding proceeds. At this stage, physics results are produced with the previously tuned, vetted, and fixed analysis codes. In this data production, all information is un-blinded and restored to the data files.
3 Implementation and Timeline for Blinded Analyses
No STAR physics analyses had access to species information prior to un-blinding. The timelines for un-blinding are estimated by the blind analysts, who present regular updates to their respective physics working groups (PWG) to document progress and to inform adjustments to the timeline. Decisions to un-blind are based upon a review of thoroughly documented analysis procedures, codes, and analysis reports–including estimates of measurement uncertainty–by the relevant PWG. In addition, for blind analyses of the isobar data, so-called “godparent committees” or “GPCs,” are set early and follow analyses closely throughout their development. The GPCs serve an important role in verifying that analyses are ready to proceed to the next stages of the blinding procedure. After the step-1 data are available, blind-data analysts estimate a timeline for completing the necessary analyses for advancing to step-2. Based on this input from the analysts, management approves a date for the beginning of the second step. Analysts present regular updates to document progress. Regardless of progress, un-blinding occurs no earlier than the original estimate unless all blind analyses are deemed ready to proceed by STAR Management. Based upon the progress reports, un-blinding may be delayed to ensure the quality of the final results. An analogous timeline procedure is done for the full un-blinding. Prior to the first un-blinding step, analysts prepare detailed notes documenting the procedures, cuts, corrections, systematic uncertainties, and criteria for any future run-by-run cuts and corrections. Prior to the second un-blinding step, analysts ensure that the documentation is updated and complete, including the run-by-run portion of analyses. Prior to each un-blinding step, analysts provide analysis codes for vetting and Q/A by the GPC in addition to the standard vetting within the physics working groups.
When the GPC is satisfied that an analysis is ready for un-blinding, analysts present the status of their analyses to the physics working group conveners and the physics analysis coordinator. As the un-blinding date approaches, analysts discuss with STAR management any need for delays to un-blinding to ensure the quality of results. If an unresolved disagreement exists between analysts, the decision to un-blind or extend the date lies with STAR management. After physics results are produced with un-blinded data, a review is conducted to verify that the frozen analysis code was used to produce the results.
While un-blinded data are not accessible to physics analyses until the blinding timeline is completed, management uses discretion in applying blinding to any calibration analysis. To ensure the integrity of calibrations, e.g. those of the beamline and TPC [14], STAR calibration experts may require access to un-blind data. Without robust calibrations, the physics analyses may not be able to achieve the required precision for deciphering a CME signal. Therefore, the relevant experts are allowed access to the un-blind data for these tasks. Furthermore, access to un-blind data is restricted to these experts alone and the experts recuse themselves from participation in any blind physics analysis.
4 Mock data challenge
As the recommended analysis blinding procedure represents a substantial departure from that typical for STAR analyses, testing feasibility is critical. Toward this end, a “mock data challenge” was conducted utilizing data from Au+Au collisions at
-202105/1001-8042-32-05-005/alternativeImage/1001-8042-32-05-005-F001.jpg)
5 After Un-blinding
After un-blinding, only changes to correct “mistakes,” defined for this purpose as errors in arithmetic or unintended departures from the approved and documented analysis procedures, are allowed. If such a correction is made, the analysis results with the error will also be provided with a detailed explanation of the specific correction applied and why it was needed. On a case-by-case basis, the collaboration considers announcing the result from a blind analysis simultaneously with the submission of the corresponding paper to the journal and the preprint arXiv. Regardless, only one set of “final” results from the blind analysis will be released, e.g. there will be no set of “preliminary” results prior to the “final” results. All STAR publications of 2018 results state explicitly whether the analysis followed the approved STAR blinding procedure.
6 Conclusion
The STAR Collaboration has developed a procedure to carry out blind analyses of isobar collision data, collected in 2018. The procedure described in this manuscript was accepted by the STAR Council in January 2018, prior to the isobar collision runs. The initial step in the procedure is an analysis of blinded data samples that interleave events from the two collision species, while the second step involves analysis of blinded data samples that do not mix events from the two collision species, followed by complete un-blinding of the data. Prior to commencing with analysis of the isobar data, a mock data challenge was successfully conducted to demonstrate the feasibility of the procedure both from an analysis standpoint and a computational standpoint. Analyses of the blind data are underway, following the procedure outlined in this manuscript.
Azimuthal Charged-Particle Correlations and Possible Local Strong Parity Violation
. Phys. Rev. Lett. 103, 251601 (2009). arXiv:0909.1739, doi: 10.1103/PhysRevLett.103.251601Observation of charge-dependent azimuthal correlations and possible local strong parity violation in heavy ion collisions
. Phys. Rev. 81, 054908 (2010). arXiv:0909.1717, doi: 10.1103/PhysRevC.81.054908Search for Chiral Magnetic Effects in High-Energy Nuclear Collisions
. Nucl. Phys. 904-905, 248c-255c (2013). arXiv:1210.5498, doi: 10.1016/j.nuclphysa.2013.01.069Parity violation in hot QCD: Why it can happen, and how to look for it
. Phys. Lett. 633, 260-264 (2006). arXiv:hep-ph/0406125, doi: 10.1016/j.physletb.2005.11.075The Effects of topological charge change in heavy ion collisions: ‘Event by event P and CP violation’
. Nucl. Phys. 803, 227-253 (2008). arXiv:0711.0950, doi: 10.1016/j.nuclphysa.2008.02.298Parity violation in hot QCD: How to detect it
. Phys. Rev. 70, 057901 (2004). arXiv:hep-ph/0406311, doi: 10.1103/PhysRevC.70.057901Testing the Chiral Magnetic Effect with Central U+U collisions
. Phys. Rev. Lett. 105, 172301 (2010). arXiv:1006.1020, doi: 10.1103/PhysRevLett.105.172301Test the chiral magnetic effect with isobaric collisions
. Phys. Rev. 94, 041901 (2016). arXiv:1607.04697, doi: 10.1103/PhysRevC.94.041901Blind analysis in nuclear and particle physics
. Ann. Rev. Nucl. Part. Sci. 55, 141-163 (2005). doi: 10.1146/annurev.nucl.55.090704.151521Observation of direct CP violation in KS,L→ππ decays
. Phys. Rev. Lett. 83, 22-27 (1999). arXiv:hep-ex/9905060, doi: 10.1103/PhysRevLett.83.22Measurement of CP violating asymmetries in B0 decays to CP eigenstates
. Phys. Rev. Lett. 86, 2515-2522 (2001). arXiv:hep-ex/0102030, doi: 10.1103/PhysRevLett.86.2515The STAR trigger
. Nucl. Instrum. Meth. 499, 766-777 (2003). doi: 10.1016/S0168-9002(02)01974-5Relative luminosity measurement in STAR and implications for spin asymmetry determinations
. AIP Conf. Proc. 675, 424-428 (2003). doi: 10.1063/1.1607171The Star time projection chamber: A Unique tool for studying high multiplicity events at RHIC
. Nucl. Instrum. Meth. 499, 659-678 (2003). arXiv:nucl-ex/0301015, doi: 10.1016/S0168-9002(02)01964-2