Searching for Fast Radio Transients with SKA Phase 1
T. M. Colegate A B and N. Clarke AA International Centre for Radio Astronomy Research, Curtin University, GPO Box U1987, Perth, WA 6845, Australia
B Corresponding author. Email: tim.colegate@icrar.org
Publications of the Astronomical Society of Australia 28(4) 299-316 https://doi.org/10.1071/AS11031
Submitted: 23 June 2011 Accepted: 29 August 2011 Published: 4 November 2011
Journal Compilation © Astronomical Society of Australia JYEAR
Abstract
The Square Kilometre Array (SKA) provides an excellent opportunity for low-cost searches for fast radio transients. The increased sensitivity and field of view of the SKA compared with other radio telescopes will make it an ideal instrument to search for impulsive emission from high–energy density events. We present a high-level search ‘use case’ and propose event rate per unit cost as a figure of merit to compare transient survey strategies for radio telescope arrays; we use event rate per beam formed and searched as a first-order approximation of this measure. Key results are that incoherent (phase-insensitive) combination of antenna signals achieves the highest event rate per beam, and that 50–100 MHz processed bandwidth is sufficient for extragalactic searches with SKA Phase 1; the gain in event rate from using the full available bandwidth is small. Greater system flexibility will enable more effective searches, but need not drive the top-level system requirements beyond those already proposed for the SKA. The most appropriate search strategy depends on the observed sky direction and the source population; for SKA Phase 1, low-frequency aperture arrays tend to be more effective for extragalactic searches, and dishes more effective for directions of increased scatter broadening, such as near the Galactic plane.
Keywords: surveys — telescopes — methods: observational — techniques: interferometric
1 Introduction
The high time resolution (HTR) universe has been relatively poorly observed at radio wavelengths and opening up this parameter space is a key science driver of the Square Kilometre Array (SKA), via ‘exploration of the unknown’ (Wilkinson et al. 2004). We define fast transients as impulsive, singly occurring or intermittent signals, emitted from high energy density events; a search for such events assumes an observed pulse width less than the normal correlator averaging time of a few seconds. In this context, pulsars can be classed as periodic fast transients. Other known transients include giant pulses, magnetars and rotating radio transients (Cordes 2009; Macquart et al. 2010b).
Interest in exploring HTR parameter space is growing; Table 1 lists some searches with existing and future telescopes. The SKA will provide at least an order of magnitude improvement in sensitivity over all these telescopes and a larger field of view (FoV) than most, as shown in Figure 1 of Macquart et al. (2010b). Both FoV and sensitivity contribute to the expected rate of event detection; employing the analyses in Cordes, Lazio, & McLaughlin (2004) and Macquart (2011), this yields at least one or two orders of magnitude improvement in event rate for the SKA. Table 1 compares Phase 1 of the SKA (SKA1) to other experiments, using the calculations developed in this paper.
Different search strategies are needed for fast transients and pulsars, despite some common HTR requirements. Pulsar surveys proposed for the SKA (e.g. Smits et al. 2009; Cordes 2009) take advantage of pulsar periodicity to improve sensitivity. They involve a computationally expensive, systematic survey of the Galaxy, where the observer ‘drives’ the telescope. The figure of merit (FoM) typically used to determine the effectiveness of such a survey measures the speed at which an area of sky is surveyed to a certain sensitivity (Cordes 2009). However, this FoM does not consider the number of events detectable in a volume of sky, nor does it give any weight to the processing cost of sampling the sky or searching the data.
The main goal of at least first-generation fast transients searches is to maximise the number of events detected in a survey. The expected rate of event detection depends on, among other factors, the search strategy employed on the telescope. Given each search strategy has a different processing cost, event rate per unit cost is a more comprehensive FoM than survey speed. Put simply, one search strategy may have a higher total rate of detection than another, but the processing cost of the strategy also needs to be considered. This is especially important for SKA1, considering transients detection will only be carried out ‘if it can be done with minimum additional cost or effort’ (Dewdney et al. 2010).
Because signal and search processing costs are architecture-specific, we use a new FoM, event rate per beam formed and searched to generalise the problem and parametrise the effectiveness of a search strategy. It is based on the rate of transient events detectable in a volume of sky as discussed in Macquart (2011), although it could similarly be applied to a FoM for surveying sky area. It assumes , which is valid when cost increases linearly with the number of beams (independent FoVs) formed and searched. This is true for first-order beamforming and data transport costs for the SKA (Chippendale, Colegate, & O’Sullivan 2007; Faulkner et al. 2010). The search costs also increase linearly because each beam signal needs to be searched individually; we consider the efficiency gain from using a single processing unit to process multiple beams to be a second-order effect.
For a given search strategy, parametrises the choice of receptor (antenna), the performance, cost and efficiency of the signal combination mode and transients search system, and the observed sky direction. Because the cost of data storage is prohibitive 1 , searches are conducted in real-time and data from candidate events recorded for subsequent verification and analysis. To reduce processing costs, search strategies alternate to those proposed for pulsar surveys can be considered. A pulsar survey need only visit each volume of sky once. In contrast, one volume of sky is considered as likely as another to contain transients (Cordes 2009), and each time a volume is re-visited, there is new detection potential. This independence from sky direction enables lower-cost commensal surveys for fast transients (piggy-back surveys where fast transients are not the primary telescope observation).
Understanding the most effective way to combine the signals from the antennas is also important. A radio telescope array produces images by correlating the antenna signals and averaging the output over a few seconds to reduce subsequent processing costs. To obtain a time resolution of order milliseconds or higher for fast transients detection, alternatives to ‘fast imaging’ are currently required and various signal combination modes must be considered. There are trade-offs between a highly sensitive mode with small FoV (such as the coherent combination of antenna signals) and less sensitive modes which cover more of the sky (incoherent combination, subarraying and ‘fly’s eye’). These trade-offs are also influenced by the spatial density of antennas in the array. Furthermore, additional processing capability enables multiple beams to be formed and searched, re-using the array collecting area to some extent. The event rate per beam also depends on observing frequency and bandwidth.
The SKA is being designed and constructed in two phases, SKA1 and SKA2, where the first is a subset of the second (Garrett et al. 2010). This paper sets out a high-level ‘use case’ for searching for fast transients with SKA1 receptors: low-frequency aperture arrays and low-band dishes (Section 2). It outlines a basis for comparing survey strategies (Section 3) and undertakes a detailed analysis of the effects of receptor choice, signal combination modes, sky direction, observing frequency and bandwidth on the event rate per beam (Sections 4 and 5). These effects are summarised (Section 6) and specific recommendations for SKA1 are made (Section 7). This paper extends the work done by the International Centre for Radio Astronomy Research (ICRAR) for the Commensal Real-Time ASKAP Fast-Transients (CRAFT) survey (Macquart et al. 2010a,b). CRAFT is one of the survey science projects planned for the Australian SKA Pathfinder (ASKAP), a designated SKA precursor instrument. The analysis is applied to the SKA1 system description to determine optimal search strategies, but the method is equally applicable to SKA2 and other radio telescope arrays.
2 Fast Transients Search Use Case
For context, this section outlines a generic fast transients search use case. Because the data volumes are too large to store cost-effectively, the search for transient events is conducted in real-time on a data stream which is a continuous observation of the sky. However, a rolling buffer recording a small period of the data as it is observed allows candidate events containing potential fast transients detections to be saved and further processed off-line. The specific implementation of a fast transients search depends on the target or expected source population and the cost and performance factors of components in the processing pipeline; see Macquart et al. (2010b), Stappers et al. (2011) and Wayth et al. (2011). Figure 1 shows a generic fast transients pipeline and the signal processing steps (actions) in this pipeline are as follows.
Signal reception Radio signals are collected by SKA1 receptors: low-frequency (70–450 MHz) aperture arrays (AA-low) and a dish array equipped with low (0.45–1 GHz) and high (1–2 GHz) band single pixel feeds. Only the low-band feed on the dishes (hereafter referred to as low-band dishes) is discussed in this paper, because it generally achieves a higher event rate than the high-band feed, given the assumptions in Section 4.1.
Station beamforming The complex signals of many AA-low elemental antennas are combined vectorially to form one or more station beams, as shown in Figure 2. The FoV of each station beam is determined by the station diameter and is similar to a single dish of the same diameter. The output station beam can then be processed in the same way as a dish signal. Though not considered here, signals from a group of dishes (instead of elemental antennas) can similarly be coherently combined into dish station beams. Note that both dish and AA stations are simply subarrays of coherently combined antennas.
Signal combination The signals from the dish or station beams can be combined coherently, incoherently or not at all. These signals are then searched for fast transients. Figure 2 shows the signal combination modes and resultant beam patterns; they are further discussed in Appendix A. Incoherent (phase-insensitive) combination sums the detected signals (powers) from antennas pointing in the same direction. Coherent combination of antennas forms a phased or tied array beam – voltages measured at each antenna are aligned in phase towards a specific direction on the sky, in a manner similar to station beamforming. Smaller groups of antennas – subarrays – can be incoherently combined and each subarray pointed in a different direction. The extreme of this is the so-called fly's eye, where every antenna is pointed in a different direction.
Event localisation and the spatial discrimination of astronomical signals from radio frequency interference (RFI) is possible for coherent combination and, using buffered voltages, for incoherent combination and subarrays of three or more incoherently combined antennas. Multiple beams (incoherently or coherently combined) can be used to discriminate RFI, where a candidate event in most or all beams indicates the presence of RFI.
Dedispersion processing The signals pass through a cosmic medium of unknown dispersion measure (DM). This means that the detection needs to be trialled for many DMs, each of which has a computational cost. The DM range to be trialled depends on the location on the sky. Clarke et al. (2011) discusses dedispersion for SKA1 in detail.
Event detection An event-detection algorithm needs to be applied to the signal from each trial DM, where optimal detection is achieved with an appropriate matched filter (Cordes & McLaughlin 2003).
Store in rolling buffer The digitised voltages from the dishes or stations are stored in a circular memory (rolling) buffer. In the case of a candidate event, the data from the buffer can be saved to another location (dumped) and processed off-line. The amount of memory required in the buffer depends on the sampling rate, sample size and the expected maximum (dispersed) pulse duration. The maximum pulse duration is a function of the range of frequencies to be captured and varies linearly with the maximum DM to be trialled. For a maximum DM of 3000 pc cm–3 and a bandwidth of a few hundred MHz, a buffer of order tens of seconds is required for dish frequencies and possibly tens of minutes for lower frequencies.
Buffer dump and off-line processing On receipt of a trigger, the buffer will dump the original voltage data to storage for off-line processing, which could include RFI filtering, analysis of the candidate detection and correlation of the dish or station beams for source localisation and imaging.
Commensal and targeted surveys A commensal survey greatly increases observation time by conducting the survey in parallel with normal telescope operations. It is passive; it uses dish or station beam signals from the primary user observation, placing little extra demand on the telescope. Such a survey is suitable for extragalactic searches, given the information about the population of such fast transients is not known a priori; hence one direction on the sky is as good as another. To observe specific areas of the sky, such as the Galactic plane and nearby galaxies, a targeted transients survey (which is the primary user observation) may be required (e.g. van Leeuwen & Stappers 2010).
Data spigot A data spigot to the dish and station beam signals is useful for transients surveys, especially those which are commensal. If the signal chain is considered to be the signal path from the antennas of a radio telescope array to the correlator, a spigot defines a point in the signal chain where users can tap off data via a well-defined interface. The spigot for fast transient searches may output either coherent (phase-preserved) data at high rates or, alternatively, incoherent data where the dish or station beam voltages are squared and integrated to a time resolution of order milliseconds to reduce the data rate and subsequent dedispersion processing load. The latter approach is being taken by CRAFT to access beams from the ASKAP beamformer (Macquart et al. 2010a). Similarly, searching the integrated signals from the dish or station beams which have been incoherently combined is a low-cost option for commensal surveys with SKA1.
The solid vertical line in Figure 1 shows the point in the flow diagram where the spigot for fast transients would need to exist to enable the signal combination modes in this paper. The pipeline after the spigot point is not part of the normal imaging mode of the telescope; the post-spigot pipeline may be implemented internally or with user-provided processing. An example of processing being implemented internally is the ‘non-imaging processing’ for pulsar observations with SKA1 (Dewdney et al. 2010); the approach being taken by CRAFT is an example of user-provided processing. Note that it is conceivable that a spigot to the AA-low elemental antenna signals could also exist, but the data rates make this option prohibitively expensive for SKA1.
3 Survey Strategy: Maximising Survey Speed and Minimising Cost
Figures of merit quantify the effect of altering the variable parameters of a problem. A simple FoM to measure the cost-effectiveness of a fast transients search strategy is the detected event rate per beam searched , a proxy for cost in the absence of sufficiently accurate design and cost information. We want to optimise for a high value of , although the total event rate for all beams must also be high enough to be of scientific benefit and open new volumes of parameter space. The qualitative advantages of a strategy, which cannot be captured in a FoM, must also be considered.
The detected event rate is effectively a survey speed. Smits et al. (2009) present a frequency-dependent FoM for survey speed (SSFoM) for dishes. It is based on surveying an area of sky, thus SSFoM is linearly proportional to FoV and sensitivity squared. In this paper, the equivalent SSFoM is the rate of transient events detectable in a volume of sky and is linearly proportional to FoV and sensitivity to the power of 3/2. It draws on event rate calculations from Macquart (2011); the derivation is shown in Appendix B.
The event rate is given by
where V max is the maximum volume out to which an object is detectable and Ωproc is the processed FoV, which is the product of the number of beams formed (N beam) and the FoV of each beam. Thus the event rate per beam is
N beam may be the number of station (N b–0) or array (N b–arr) beams. For subarrays, it is the product of the number of subarrays and station beams formed in each subarray (N sa N b–0).
We describe an extragalactic survey as a search for a homogeneously distributed population of isotropically emitting fast transients of fixed intrinsic luminosity. For such a population, the event rate is
where ρ (events s–1 pc–3) is the event rate density, is the intrinsic luminosity of the population, W i is the intrinsic pulse width, W is the observed pulse width and S min is the minimum detectable flux density of the telescope for an integration time of τ = W i. The (W i/W)3/4 term approximates the loss in signal-to-noise (S/N) due to pulse broadening (see Appendix B). The frequency dependence of is discussed in more detail in Section 4.3.
Macquart (2011) shows that the event rate is proportional to Ωproc and S min 3/2 for an extragalactic population with a luminosity distribution which follows a power-law or lognormal distribution. Although the actual event rate depends on the luminosity distribution, the proportionality still holds for Ωproc and S min, which is sufficient to compare telescopes and their signal combination modes. For fast transients searches within the Galaxy, scatter broadening due to multipath propagation in the interstellar medium makes the event rate per beam dependent on frequency and direction. Although comprehensive direction-dependent modelling is beyond the scope of this paper, the loss in sensitivity due to scattering is incorporated in the W term in Equation 3 and modelled for some representative sky directions. This first-order analysis gives an indication of how SKA1 will perform as a function of frequency; see Section 5.3 for further discussion of Galactic objects.
4 Modelling Event Rates
Event rate per beam is a simple metric to model fast transients event rates in the cost-constrained environment of SKA1. This section shows how signal combination mode and filling factor (antenna spatial density) affect for a radio telescope array. The dependence on frequency and the effectiveness of searching large bandwidths are also considered. The modelling is specifically applied to the SKA1 receptors, for source populations whose intrinsic luminosity either does not vary with frequency or varies with ν–1.6, a value typical of the pulsar population (Lorimer et al. 1995). This paper uses the system description of SKA1 (Dewdney et al. 2010) to make trade-offs — the relevant details are given in Table 2. However, the complete system design for SKA1 is still under development and subject to a decision-making process involving trade-offs and performance and cost optimisation.
4.1 Assumptions
The trade-offs in this section make the following simplifying assumptions:
-
The population of fast transients is homogeneously spatially distributed and of fixed intrinsic luminosity.
-
A matched filter is used to detect the dedispersed but scatter-broadened pulse (as per Cordes & McLaughlin 2003).
-
The effects of scintillation on source intermittency and optimum search bandwidth are ignored.
-
The dedispersion processing system does not contribute to pulse broadening (see Appendix C for a description of these instrumental contributions).
-
The intrinsic pulse width is 1 ms. Shorter-duration pulses would be more sensitive to S/N loss due to pulse broadening, longer-duration pulses less sensitive.
-
Events are broad-band such that the intrinsic spectral bandwidth of the pulse is greater than the processed bandwidth. Thus all channels across the band contain contributing signal.
-
The beam has constant (maximum) sensitivity between the half-power beamwidth points, and zero sensitivity outside of that.
-
Beamformer calibration costs are not considered.
-
A time-to-frequency-domain transformation (channelisation) and cross-correlation ‘FX’ correlator is used, as opposed to other correlator topologies such as ‘XF’. It is the most cost-effective architecture for the SKA, and allows other signal-processing actions, such as beamforming and RFI excision, to be done efficiently (Hall et al. 2008).
-
The processing cost of forming and searching a beam is independent of frequency, bandwidth and signal combination modes. In practice, lower frequencies (where the maximum dispersed pulse duration is longer) and larger bandwidths will increase processing costs; the magnitude of the increase is specific to the processing architecture and the effect of sky direction on the DM range to be trialled.
4.2 Signal Combination Mode Comparisons
This section compares the per-beam event rate for incoherent and coherent combination and fly’s eye (Figure 2 and Appendix A) and applies these results to SKA1 AA-low and low-band dishes. We refer to a single dish or AA station as an element, designated with the subscript 0. The combination of an array of N 0 elements may refer to the number of elements in the total array, or some subset of the total array (e.g. the SKA1 core region in Table 2).
Incoherent combination of an array of N 0 elements increases the sensitivity by a factor of over a single element while retaining its FoV, Ω0. Forming N b–0 station beams linearly increases the FoV. To increase the FoV even further, N sa subarrays can be incoherently combined. Each subarray is pointed in a different direction, increasing the FoV by a factor of N sa but only increasing the sensitivity of the array by a factor of over a single element, where N 0/sa is the number of elements per subarray. Fly's eye pertains to the case N 0/sa = 1.
Sensitivity of the coherent combination of an array of N 0 elements is higher than incoherent combination and subarraying; it increases proportional to N 0. However the FoV of the array beam, Ωarr, is much smaller; it is proportional to D arr –2, where D arr is the diameter of the array of elements being combined. The FoV can be linearly increased by forming N b–arr array beams.
Applying these relationships to Equation 3 gives the total event rate for each signal combination mode:
where N pol is the number of polarisations summed, Δν is the processed bandwidth, τ is the post-detection integration time (which also defines the time resolution of the observation), A e0 is the effective area of an element (dish or station), σ is the S/N ratio required for event detection and T sys is the system temperature.
4.2.1 Filling Factor Efficiency
The coherent combination mode is more effective if the dishes or stations are closely spaced, thus having a higher filling factor. In this case, the same number of elements are being combined, but the array beam FoV is larger. D’Addario (2010) considers the number of coherently combined array beams required to achieve an event rate FoM equivalent to one incoherently combined beam for ASKAP. We modify this analysis and apply it to .
Following Cordes (2009), we define the number of pixels (N pix) as the maximum number of independently pointed, coherently combined array beams which can be formed within the FoV of a single element beam. It is frequency independent, and given by
where and are the element and array beam tapers respectively.
A measure of the effectiveness of the coherent combination mode is the number of array beams which need to be formed and searched to achieve a coherent combination event rate equal to incoherent combination . From Equation 4, gives
where η is the fraction of elements in the array which are coherently combined, out of a total N 0. For example, η = 0.5 if only the elements in the SKA core are coherently combined while those in the total array are incoherently combined.
Achieving the highest possible event rate is desirable, but this must be tempered by the cost of searching multiple beams. The relative event rate per beam depends on the array filling factor and is simply the inverse of Equation 6 when N b–0 = 1:
For incoherently combined subarrays (when all N 0 elements are formed into subarrays),
where N sa subarray beams are searched.
4.2.2 Coherent Combination for a Fully Filled Array
Assuming a best case scenario of an array entirely filled with stations of equal diameter and
Substituting into Equation 6 and for η = 1, the theoretical minimum number of array beams required so that is
and the relative event rate per beam (Equation 7) is
This estimation is optimistic towards coherent combination, given that it is not physically possible to entirely fill a circular array with circular stations. Regardless of this, for N 0 > 1, the incoherent combination will always achieve a higher event rate per beam searched than coherent combination, and this difference increases with N 0. These results are frequency independent.
4.2.3 Signal Combination Modes for SKA1
A flexible processing system allows various signal combination modes; Table 3 compares the event rate for incoherent and coherent combination and fly's eye, for SKA1 receptors. The number of array beams required such that is calculated for coherent combination. The event rate per beam formed and searched, relative to is calculated for all modes. Three coherent combination modes are shown: elements in the core region, inner and core region and the total array. Of these, using the elements in the core achieves the highest event rate per beam, due to the higher density of collecting area. For this reason, the core will be the only coherent combination mode further analysed in this paper.
The coherent combination of the AA-low core requires approximately five array beams to equal the event rate of a single beam of the incoherently combined total array. Although the fly's eye mode achieves a higher event rate by a factor of 501/4 = 2.6, the relative rate per beam is less than the coherently combined core. If the dense packing is not achievable (Dewdney et al. 2010), the number of array beams required will be higher. Indeed, using optimisations from Graham et al. (1998), the optimal packing of 25 congruent circles of diameter 180 m in a circle results in a minimum core diameter of 1036 m. This is larger than the 1000 m diameter in Dewdney et al. (2010) and excludes any spacing that may be required for infrastructure.
Due to the lower filling factor, 157 beams formed from the coherent combination of the dishes in the core are required to equal the event rate of the incoherently combined array. The event rate of the fly's eye mode is higher by a factor of 2501/4 = 4 and its relative rate per beam is higher than the coherently combined core. Although not shown, values for subarrays lie between fly's eye and incoherent combination modes, and depend on the number of elements per subarray.
4.3 Frequency Dependence
The event rate has a frequency dependence on luminosity, minimum detectable flux density, FoV and scatter broadening, designated by subscript ν:
It is important to understand this frequency dependence because of the wide fractional bandwidths of the SKA. Looking at each of these dependencies in turn:
-
The processed FoV depends on the number of beams formed and whether they are formed incoherently or coherently, but either way is proportional to ν–2:
-
where c is the speed of light. For incoherent combination, Nbeam is the number of station beams formed, is the feed illumination factor or station beam taper and D is the diameter of the dish or station. For coherent combination, Nbeam is the number of array beams formed, is the array beam taper and D is the diameter of the array (see Appendix A).
-
Pulses are broadened due to scattering. The broadening time τd depends on the path through the Galaxy to the observer, and scales as τd∝ν–4.4 (Cordes & Lazio 2002). The observed pulse duration is given by
-
See Appendix C for further details.
-
Because we are looking for an unknown population, we do not know how the luminosity varies with frequency. However, spectral indices for pulsars have been measured. We use ξ = –1.6 (Lorimer et al. 1995), a value typical of the pulsar population, such that
-
where is luminosity at reference frequency ν0. For comparison, we also consider a flat spectrum population (ξ = 0).
-
Sminν is a function of Tsysν and Ae0ν:
For aperture arrays, the effective area of a station is approximately
The AA-low system temperature is the sum of the receiver noise and an approximation to the sky temperature:
A graphical breakdown of the frequency dependencies of AA-low is shown in Appendix D.
The dependence of event rate on frequency for three representative sky directions and spectral indices of ξ = 0 or –1.6 is shown in Figure 3. The normalised is plotted at 1 MHz intervals for centre frequency ν and processed bandwidth Δν = 1 MHz, spanning the SKA1 system description frequency range of AA-low and low-band dishes (70 MHz ≤ ν ≤ 1000 MHz). The relative event rate per beam between signal combination modes (the rightmost column of Table 3) still applies as a multiplicative factor to the data in Figure 3, regardless of sky direction and frequency.
The simplest case to consider is a search for extragalactic fast transients (solid line in Figure 3). In this case there is no sensitivity loss due to scatter broadening; W » τd is assumed. Cordes & McLaughlin (2003) find that for a given sky direction, the scatter broadening of an extragalactic source will be approximately six times the broadening from the Galaxy alone. This assumes equal scatter broadening in the host galaxy, if there is one, and no contribution from the intergalactic medium. For directions away from the Galactic plane where τd is low and for an intrinsic pulse width of 1 ms, the exclusion of scatter broadening is a reasonable first-order assumption.
A first-order analysis of a search for Galactic transients is possible by invoking the simplifying assumptions listed in Section 4.1. We calculate relative event rates taking into account estimates of scatter broadening at a distance of 30 kpc and frequency of 1 GHz from the NE2001 model of Cordes & Lazio (2002), where the broadening scales as τd∝ν–4.4. A distance of 30 kpc determines the maximum broadening due to interstellar scattering for that direction; broadening is less at shorter distances. Two representative sky directions for Galactic transients are:
-
above the Galactic plane: τd = 0.03 ms (DM = 215 pc cm–3) at l = 300, b = 10
-
on the Galactic plane: τd = 1.78 ms (DM = 628 pc cm–3) at l = 300, b = 0.
For these directions, the normalised in Figure 3 shows how increased scatter broadening reduces the event rate at lower frequencies. In this first-order analysis, the effect of scatter broadening on event rate is independent of the signal combination mode chosen; the frequency-dependent (W i/W)3/4 term in Equation 4 is common to all modes. The factors to consider for a more detailed analysis of Galactic transients searches is outlined in Section 5.3.
From Figure 3, the incoherent combination of dish signals would be most efficient for searching for extragalactic sources with a low spectral indices. For a Galactic population, the preferred receptor depends on the amount of scatter broadening. For directions near or on the Galactic plane, low-band dishes show a higher than AA-low. However, the difference between the two receptors is less than an order of magnitude for steep spectrum sources near the Galactic plane. For steep spectrum sources, even those on the Galactic plane, the higher event rate is at the lower end of the frequency band of each receptor. For shallow spectrum sources near or on the Galactic plane, there is no strong maximum within a receptor frequency band. For dishes, the maximum may be at the low (450 MHz) or high (1 GHz) end of the frequency band, for small or large τd respectively. For AA-low, the event rate quickly reduces for frequencies below νtransition = 115 MHz.
4.4 Large Processed Bandwidths
Processed bandwidth (Δν) is the bandwidth of the astronomical signal at the fast transients detection system. In radio astronomy, an increase in processed bandwidth is usually assumed to produce a increase in signal-to-noise. From Equation 4, this would produce a Δν3/4 increase in event rate. However, unless the event rate is approximately constant across frequency, this increase does not hold for large processed bandwidths.
To calculate the event rate over a large processed bandwidth, the frequency-dependent channel contributions shown in Figure 3 are summed. Where the pulse broadening (due to propagation and instrumental effects) is not significant, the event rate is calculated numerically for a processed bandwidth of Δν = N chΔνch using
where N ch is the number of frequency channels of width Δνch. The derivation is shown in Appendix E. Table 4 makes these calculations for different modes and spectral indices, for the full bandwidth of AA-low and low-band dishes. The relative event rate per beam between signal combination modes of a given receptor (Equations 7 and 8) still hold for large processed bandwidths. Strikingly, for the combined AA-low and low band dish event rate, the contribution from the dishes is only significant for the incoherently combined total array case, when ξ = 0.
Given Δν = N chΔνch, plotting as a function of the number of contributing channels shows the decreasing contribution of higher frequency channels to the event rate. This decreasing contribution results in the event rate increasing by less than Δν3/4. The only exception is in the dense AA regime (<115 MHz) for low spectral indices (–1 ξ ≤ 0), where the decreasing contribution comes from lower frequency channels.
Consider two cases for extragalactic searches, plotted in Figure 4: the AA-low band from 115 to 450 MHz and the low-band dishes where the whole 550 MHz bandwidth is available. The ideal case of a Δν3/4 increase over the Δν = 1 MHz event rate is also shown. As expected, this plot shows that the maximum bandwidth achieves the highest event rate. However, the event rate curve flattens out well before the maximum bandwidth, especially for AA-low and also steeper spectrum sources.
Assuming a limited amount of signal processing is available, the following question arises: at what point can the processing be more effectively used elsewhere in the fast transients pipeline, and how is this quantified? One method is to arbitrarily set a threshold beyond which additional channels contribute very little to the event rate. Channels of increasing frequency are included while the following is true:
For example, say the threshold is set to 0.5%. Then for the ideal case, N ch = 150 channels contribute to . Adding a 151st channel will contribute less than 0.5% to . For the two cases plotted, Table 5 shows the maximum channel for which the improvement in event rate over the rate without that channel is greater than 0.5%.
To interpret this table, compare the maximum contributing channel number when ξ = –1.6. For AA-low, 47 channels (Δν = 47 MHz) contribute above the threshold. For dishes, this is achieved with Δν = 87 MHz, implying that the AA-low processed bandwidth becomes less useful more quickly. This is expected, given the steeper spectral dependence of AA-low over dishes, shown in Figure 3.
5 Discussion
5.1 Effectiveness of Combination Modes
Comparing signal combination modes and array configuration using event rate per beam gives a frequency-independent analysis of the trade-offs for a given receptor. Incoherent combination always achieves a higher than coherent combination, as shown in Section 4.2.2. The difference increases with the number of elements or a reduced filling factor. The per-beam analysis is important because it captures the first-order processing costs for each beam. For this reason, although fly's eye gives the highest total event rate, it comes at the expense of searching many more beams.
However, the per-beam analysis does have some limitations. The total event rate of a search strategy is
where N beam is the number of beams formed and searched. Depending on the signal combination mode used, N beam may be the number of station (N b–0) or array (N b–arr) beams, or N sa N b–0 for N sa incoherently combined subarrays. Electromagnetic and signal processing system design and cost considerations will put maxima on each of these. For example, the number of station beams which can be formed within the antenna FoV is limited by performance degradation, the diameter of the station and the beamforming processing power available. Once the limit on forming more station beams is reached, will eventually become higher for coherent rather than incoherent combination. This is because N pix array beams can be formed within each of the N b–0 station beams (Equation 5), allowing a maximum of N pix N b–0 array beams to be formed.
Dishes with single-pixel feeds only have a single beam (i.e. N b–0 = 1), so of the incoherent combination of dishes cannot be increased by forming more beams. However, if phased array feeds (PAFs) are available on dishes, then N b–0 dish beams can be formed, increasing the total event rate.
So we must consider: do we want to form and search more array beams, subarrays or incoherently combined station beams? This will be influenced by station beamforming (for AAs) and array beamforming costs, search costs and the array filling factor. Although exact costs are unavailable, we can generalise preferred combination as follows:
-
Incoherent combination Achieves the highest event rate per beam, making it preferable in most cases. If the cost of searching a beam signal is high, incoherent combination presents a further advantage over other modes.
-
Coherent combination Requires an array with a high filling factor and low beam-search costs; it can achieve the highest total event rate if many array beams are formed and searched. For AA-low, the cost of forming multiple coherently combined array beams must be lower than the cost of forming and incoherently combining multiple station beams.
-
Incoherently combined subarrays and fly's eye This mode is only preferable when the beam search cost is low. A fly's eye mode excludes the buffering and source localisation advantages of an array, and commensality with most observations (see Section 2). Sets of three-element subarrays counter this problem and could employ antennas unused by the primary user observation. For example, splitting 24 AA-low stations outside the core into 8 of these three-element subarrays results in a total event rate approximately equal to the incoherent combination of 50 stations.
The SKA1 AA-low results show a weak preference for the incoherent combination of station beams: five array beams formed from the coherent combination of the AA-low stations in the core are required to equal the event rate of a single beam of the incoherently combined total array, assuming the stations are very closely packed. If multiple station beams are formed ‘at no cost’ for normal (imaging) array observing, or the cost of array beamforming is high, the advantage of incoherent combination is increased. In practice, other effects such as RFI mitigation and station and array beam quality also need to be taken into account.
For SKA1 low-band dishes, 157 array beams from the coherent combination of dishes in the core are required to equal the event rate of the incoherently combined total array. Coherent combination with dishes would only be optimal for hundreds of array beams, and even then it is likely that the processing power would be more effectively spent on AA-low. However, an incoherent commensal survey with low band dishes would be a cost-effective method to cover parameter space.
5.2 Frequency and Bandwidth Effects
The relative event rate per beam between signal combination modes described in the previous section is independent of frequency for searches of extragalactic populations and the first-order analysis of Galactic populations. The frequency-dependent effects in Equation 4 are common to all modes; the exception being processed FoV, however that cancels when the relative rate is calculated.
For a given signal combination mode, the event rate per beam changes with frequency, as shown in Figure 3. For the extragalactic case (or more generally, when the observed pulse width is much greater than the scatter broadening), the slope on the dish event rate is due to the spectral index of the source and changing FoV; for aperture arrays, the frequency dependence of T sys (due to sky noise) and A e is also a factor. Additionally, some pulsars display a turnover (a break in the spectrum where pulsar brightness is maximum) around 100–200 MHz (Malofeev et al. 1994), and others above 1 GHz (Kijak et al. 2011). The event rate for such sources would decrease below the turnover frequency. For the Galactic case, the scatter broadening increasingly reduces the S/N for lower frequencies, in addition to the other factors described above.
For SKA1, is generally higher for AA-low than dishes, however the opposite is true for sources with low spectral indices and directions of larger scatter broadening. Also, lower frequencies have an increased memory cost for dedispersion. When low spectral indices, increased scatter broadening or low-frequency turnovers are factors and only a few AA stations or array beams are available, the incoherent combination of the low-band dishes gives a higher . If SKA1 dishes are equipped with PAFs, this would increase the total incoherent combination event rate of the dish array, as multiple station beams does for AA-low.
The frequency and bandwidth effects on event rate per beam are interdependent. Section 4.4 shows that an optimal frequency range for searching for fast transients with SKA1 may be smaller than the full band of the receptor. As the frequency increases, the FoV reduces and the source luminosity is expected to decrease for sources with pulsar-like emission characteristics. Processing more channels (hence bandwidth) increases , however this increase is less than Δν3/4.
For some threshold beyond which extra bandwidth contributes little to the event rate, the processing for channels above this threshold could be more effectively used to form and search extra beams (increasing through FoV), trial more DMs or increase the detection S/N (sensitivity) through more optimal dedispersion techniques. The threshold depends on the costs of forming and searching the beams, which increases with bandwidth. Also, the number of channels (hence bandwidth) required to reach this threshold reduces for steeper spectrum sources (see Table 5). For searches of populations where the event rate is increasing with frequency, such as directions of large scatter broadening, the first channel is at the highest frequency and further contributing channels come from lower frequencies.
Even if the search does not need the full band, recording it in a buffer is desirable to enable a more powerful analysis of detected transients with dedicated processing – this is especially important for relatively rare events. For flat spectrum sources, the S/N improves by a factor of . The extra spectral information is also a useful analysis tool. For example, the multipath propagation which causes scatter broadening would be more evident with a larger bandwidth, given the ν–4.4 relationship in Cordes & Lazio (2002), allowing us to distinguish between the intrinsic pulse width and a pulse that has been scatter-broadened.
5.3 Application to Galactic Populations
A search for a Galactic population of fast transients is dependent on observing direction. The first-order analysis of scatter-broadening shows that the event rate varies as a function of observation direction and intrinsic pulse width. The dependence of event rate on frequency, described previously, still applies. For an intrinsic pulse width of 1 ms, low-band dishes give a higher event rate than AA-low for directions on or near the Galactic plane where the scatter broadening is large. Although the effectiveness of AA-low is reduced at these directions, the increased luminosity of steep spectrum sources at lower frequencies somewhat counters this effect. Observations using the high-band feed (1–2 GHz) on dishes are not modelled here, but in areas closer to the Galactic centre, where scatter-broadening is increased, it would achieve an event rate higher than the low-band feed for spectrally shallow sources and sources with smaller intrinsic pulse widths.
The event rate per beam curve for populations near the Galactic plane (Figure 3) is quite flat in comparison to extragalactic populations, especially for shallow spectrum sources. Because of this, additional processed bandwidth is more useful than for extragalactic populations, although it still cannot increase the event rate by more than the ideal case of Δν3/4. For the threshold discussed in Section 4.4, more channels would contribute to the event rate.
Besides scatter-broadening, another limit on the event rate occurs if the telescope is sensitive enough to observe the population to the edge of the Galaxy (i.e. the sources are luminous enough to be observable to the edge of the Galaxy). In that case, the event rate for a sensitivity-limited volume (as used in this paper) is not valid, because the limit is instead imposed by the boundary of the Galaxy. Macquart (2011) captures both of these effects by introducing a direction-dependent factor δ, such that
where 0 ≤ δ ≤ 3/2. A key result in Macquart (2011) is that FoV is more strongly preferred over sensitivity when scatter broadening or volume boundary limits increase δ. Further modelling of the directional dependence of δ is shown in that paper.
The total number of events detected in a transients survey is proportional to , where t is the total observation time. When δ is large, the contribution from S min is lessened. In this case, the total number of events detected can be more effectively increased by forming multiple beams (if available), incoherently combining the array or subarray signals (gaining a much larger FoV than coherent combination) or simply spending more time observing the sky. A low-cost commensal survey using the incoherently combined array achieves these, making it even more effective for populations near the Galactic plane.
To determine the number of events detectable for a Galactic population, simulations must account for all frequency-dependent effects on as a function of sky direction. Simulations for LOFAR pulsar searches (van Leeuwen & Stappers 2010) account for T sky (which modifies telescope sensitivity) as a function of direction and scatter-broadening as a function of direction and distance. For a population with a distribution of luminosities, the event rate is distance-dependent. By numerically integrating along each line of sight, Macquart (2011) accounts for sources at each distance step which can no longer be detected below a certain sensitivity due to scatter-broadening. Such an analysis is beyond the scope of this work, and requires assumptions to be made about the luminosity and spatial distribution of the objects, and their intrinsic pulse widths.
5.4 Further Work
Maximising involves trade-offs between the receptor, signal combination mode, observing frequency and bandwidth for a given sky direction. This first-order analysis assumes . The next logical step is to describe the cost of forming and searching beams, to maximise . This problem is complex because beamforming and dedispersion processing costs are architecture-specific. However, modelling these costs will give a better understanding of the optimal bandwidth and frequency to use for a given survey strategy and sky direction. Areas of work to further maximise are:
-
Trade-offs between station size and number of stations, for fixed AA-low collecting area.
-
The S/N gain versus processing cost from using a more accurate incoherent dedispersion method.
-
The S/N gain versus processing cost of using coherent dedispersion. Incoherent dedispersion has a lower S/N ratio than coherent dedispersion, but requires significantly less processing. To maintain the same detection rate as coherent dedispersion, how many more beams need to be formed and searched?
6 Conclusions
Radio telescopes are inherently flexible, giving rise to a large design trade-off space, but any trade-offs must be made with the consideration of cost. This paper presents a new figure of merit, event rate per beam , to measure the effectiveness of a survey strategy in detecting transient events in a volume of sky, while considering first-order costs — the ultimate goal being to calculate the event rate per unit cost . The results show the complexities in determining an optimal receptor, signal combination mode and frequency range for a given array. They highlight some important points for searching for fast transients with SKA1 and more broadly with radio telescope arrays.
The per-beam event rate enables a frequency-independent analysis of the optimal signal combination mode and array configuration for a given receptor. This analysis is applied to SKA1, but regardless of the telescope used, incoherent combination always achieves a higher , making it preferable to coherent combination, subarraying and fly's eye in most cases. This is due to the event rate for surveying a volume of sky increasing faster with FoV than with sensitivity. The advantage of incoherent combination increases with a lower filling factor or the combination of an array with more elements. The exception to this result can occur once the number of dish or station beams formed reaches its maximum, due to physical and processing constraints. In this case, forming more array beams can result in coherent combination achieving a higher total event rate than incoherent combination. Fly’s eye is unattractive because of the need to search many more beams, and its lack of localisation capability. However, if the search cost is low, incoherently combined three-element subarrays could usefully employ antennas unused by the primary (e.g. imaging) user observation. An example with SKA1 would be splitting 24 AA-low stations outside the core into 8 three-element subarrays, to achieve a total event rate similar to the incoherent combination of the total array.
From Section 4.2, the per-beam event rates for incoherent and coherent combination can be compared using
where N 0 is the number of stations or dishes used for incoherent combination, η is the fraction of N 0 used for coherent combination and N pix is the number of coherently formed array beams required to fill the FoV of the station or dish beam. This reduces to
for the ideal densely packed case, although the ratio is actually higher due to physical limitations. If the N 0 antennas are divided into N sa subarrays,
Dependence on observing frequency and bandwidth needs to be considered when making event rate calculations, as do the observation direction and expected spectrum of the source. The results in this paper show that for commensal observing, where the direction of observation is determined by the primary telescope user, a fast transients pipeline must dynamically adjust the search strategy to achieve the highest event rate. For a targeted survey, the choice of receptor needs careful consideration.
For extragalactic searches with SKA1, the full available bandwidth does not need to be searched; it is not an optimal use of the processing system. The contribution to event rate from processing additional bandwidth decreases as the signal frequency increases. Beyond a threshold, the processing could be more effectively used to form and search more beams, trial more DMs or increase the detection S/N ratio. However, the full band is desirable for analysis of detected transients and for searches in directions where scatter broadening is such that the event rate is approximately constant with frequency.
7 Recommendations for SKA1
-
SKA processing needs to provide flexible search modes The preferred receptor and signal combination mode depends on direction, especially for Galactic populations. Incoherent and coherent combination modes are both effective with SKA1 and depend on the array filling factor, signal processing costs and the spectrum of the source population.
For SKA1, coherent combination of the AA-low core requires approximately five array beams to equal the event rate of a single beam of the incoherently combined total array. For low-band dishes, this number is 157, making the coherent combination of dishes unattractive unless more than this many array beams can be formed.
SKA1 AA-low is effective for extragalactic transients searches (assuming dedispersion costs are not too high), especially for steep spectrum sources (see Figure 3). For flat-spectrum sources, the advantage of aperture arrays disappears. Low-band dishes give a higher event rate for directions near the Galactic plane, where scatter broadening is larger.
-
The re-use of signal processing required for SKA imaging modes enables low-cost fast transients searches Commensal surveys using the incoherent combination of dish signals or AA station beams are low-cost options for searching for fast transients with SKA1. The incremental cost of implementing such a search is small because it uses beams formed for the primary user observation. A commensal survey using both receptors is a simple method to increase the total number of events detected, by increasing the total observation time. The survey effectiveness, compared to the coherent combination of the array, increases for populations closer to the Galactic plane.
Access to the dish and station beam data should be via a spigot with a well-defined interface, to enable the implementation of flexible search modes. The processing for commensal surveys could be implemented internally or with user-provided processing as it becomes affordable or available.
-
Requirements for fast transient searches with SKA1 Until the processing costs are further explored, the base requirements for SKA1 are:
-
Availability of incoherent and coherent combination modes for AA-low and low band dishes.
-
Processing for low-cost commensal survey modes; or provision for access to the dish and AA station beam data via spigots.
-
Voltage (coherent) buffering capability of the full band; of order tens of seconds for dish frequencies and possibly minutes for lower frequencies.
For extragalactic searches, processing the full available bandwidth is not required. Bandwidths of 50–100 MHz are sufficient on the basis of the simplified investigation undertaken in this paper; a more detailed study of the trade-offs could be made. However, buffering the full band is desirable for analysis of detected transients.
-
Acknowledgements
The authors thank R. D. Ekers, P. J. Hall, J.-P. Macquart and S. Ord for discussions and suggestions. The International Centre for Radio Astronomy Research is a Joint Venture between Curtin University and the University of Western Australia, funded by the State Government of Western Australia and the Joint Venture partners. T. M. Colegate is a recipient of an Australian Postgraduate Award and a Curtin Research Scholarship.
References
Burke-Spolaor, S. and Bailes, M., 2010, MNRAS, 402, 855| Crossref | GoogleScholarGoogle Scholar |
Chippendale A. P. , Colegate T. M. & O'Sullivan J. D. , 2007, SKA Cost: A Tool for SKA Cost and Performance Estimation, SKA Memo 92
Clarke N. L. , D'Addario L. , Navarro R. , Cheng T.-H. & Trinh J. , 2011, An Architecture for Incoherent Dedispersion, CRAFT Memo 6
Cordes J. M. , 2009, The SKA as a Radio Synoptic Survey Telescope: Widefield Surveys for Transients, Pulsars and ETI, 14th edn., SKA Memo 97
Cordes J. M. & Lazio T. J. W. , 2002, arXiv:astro-ph/0207156
Cordes, J. M. and McLaughlin, M. A., 2003, ApJ, 596, 1142
| Crossref | GoogleScholarGoogle Scholar |
Cordes, J. M., Lazio, T. J. W. and McLaughlin, M. A., 2004, NewAR, 48, 1459
| Crossref | GoogleScholarGoogle Scholar |
Cordes, J. M. et al., 2006, ApJ, 637, 446
| Crossref | GoogleScholarGoogle Scholar |
D'Addario L. , 2010, ASKAP Surveys for Transients: Which Observing Mode is Best?, SKA Memo 123
Deneva, J. S. et al., 2009, ApJ, 703, 2259
| Crossref | GoogleScholarGoogle Scholar |
Dewdney P. E. , bij de Vaate J.-G. , Cloete K. , Gunst A. W. , Hall D. , McCool R. , Roddis N. & Turner W. , 2010, SKA Phase 1: Preliminary System Description, SKA Memo 130
Faulkner A. J. , et al., 2010, Aperture Arrays for the SKA: the SKADS White Paper, SKA Memo 122
Garrett M. A. , Cordes J. M. , Deboer D. R. , Jonas J. L. , Rawlings S. & Schilizzi R. T. , 2010, A Concept Design for SKA Phase 1 (SKA1), SKA Memo 125
Graham, R. L., Lubachevsky, B. D., Nurmela, K. J. and Ostergard, P. R. J., 1998, Disc. Math., 181, 139
| Crossref | GoogleScholarGoogle Scholar |
Hall P. J. , Schilizzi R. T. , Dewdney P. E. & Lazio T. J. W. , 2008, The Radio Science Bulletin, 326
Hankins T. H. & Rickett B. J. , 1975, in Methods in Computational Physics, ed. B. Alder, S. Fernbach, & M. Rotenberg, Vol. 14, 55–129
Hessels J. W. T. , Stappers B. W. & van Leeuwen J. , & Transients Key Science Project, 2009, in The Low-Frequency Radio Universe, ed. D. J. Saikia, D. Green, Y. Gupta. & T. Venturi
Keane E. F. , Kramer M. , Lyne A. G. , Stappers B. W. , & McLaughlin M. A. , 2011, MNRAS, 415, 3065
Keith, M. J. et al., 2010, MNRAS, 409, 619
| Crossref | GoogleScholarGoogle Scholar |
Kijak, J., Lewandowski, W., Maron, O., Gupta, Y. and Jessner, A., 2011, A&A, 531, A16
Lorimer, D. R., Bailes, M., McLaughlin, M. A., Narkevic, D. J. and Crawford, F., 2007, Sci, 318, 777
| Crossref | GoogleScholarGoogle Scholar | 1:CAS:528:DC%2BD2sXht1elsrnL&md5=73b8ab97e4ea5b3bcafe64c2b11dd18dCAS |
Lorimer, D. R., Yates, J. A., Lyne, A. G. and Gould, D. M., 1995, MNRAS, 273, 411
Macquart J.-P. , Hall P. J. & Clarke N. , 2010a, in ‘International SKA Forum 2010 Science Meeting’ PoS(ISKAF2010)03, Assen, the Netherlands
Macquart, J.-P. et al., 2010b, PASA, 27, 272
Macquart, J.-P., 2011, ApJ, 734, 20
| Crossref | GoogleScholarGoogle Scholar |
Malofeev, V. M., Gil, J. A., Jessner, A., Malov, I. F., Seiradakis, J. H., Sieber, W. and Wielebinski, R., 1994, A&A, 285, 201
Siemion A. , et al. 2011, in BAAS, Vol. 43, American Astronomical Society Meeting Abstracts #217, 240.06
Smits, R., Kramer, M., Stappers, B., Lorimer, D. R., Cordes, J. and Faulkner, A., 2009, A&A, 493, 1161
Stappers, B. W. et al., 2011, A&A, 530, A80
van Leeuwen, J. and Stappers, B. W., 2010, A&A, 509, A7
Wayth R. B. , Brisken W. F. , Deller A. T. , Majid W. A. , Thompson D. R. , Tingay S. J. & Wagstaff K. L. , 2011, ApJ, 735, 97
Wilkinson, P. N., Kellermann, K. I., Ekers, R. D., Cordes, J. M. and Lazio, T. J. W., 2004, NewAR, 48, 1551
| Crossref | GoogleScholarGoogle Scholar |
Zarb Adami K. , Faulkner A. , bij de Vaate J. G. , Kant G. W. & Pickard P. , 2010, in Phased Array Systems and Technology (ARRAY), 2010 IEEE International Symposium on, 883–890
1 For example, ~20 GB per station beam per second would need to be written to storage for SKA1 AA-low (380 MHz bandwidth × 2 Nyquist × 4 bits × 50 stations).
Appendices
A Signal Combination Techniques
The performance attributes of a radio telescope array depend on how the signals from the array elements are combined. This in turn affects the detection rate for fast transients. The array elements may be an antenna (such as a dish or dipole) or a phased group of antennas (stations). We term the single antenna primary beam or the phased station beam as the element beam, with FoV Ω0. The signals from these elements may then be combined incoherently or coherently as discussed later in this section.
A.1 AA Station Beamforming
An aperture array station will have thousands of individual antennas. To reduce the data rate from the station and the downstream signal processing load, the antennas will be phased into a station beam, as described in Zarb Adami et al. (2010). Multiple stations beams can be formed by applying the appropriate phase shift. The beamformer processing cost scales as N a/st N b–0Δν operations per second, where N a/st is the number of antenna elements per station, N b–0 is the number of station beams formed and Δν is the bandwidth (Cordes 2009).
A.2 Modes of Beamforming for Searching
For antenna, phased-array feed or station beams pointing at the same location on the sky, the signals detected may be combined incoherently or coherently. The following tables show how sensitivity, FoV, beamformer processing cost and the number of data streams to be searched scale for different signal combination modes. The scaling equations assume that the polarisations are summed prior to searching and all elements are of equal diameter and sensitivity.
A.2.1 Incoherent Combination
Incoherent combination of the element signals requires the signal from each element to be detected, a geometric delay applied and the signals summed. Figure 5 shows the steps to incoherently combine signals and Table 6 shows some performance attributes.
As long as the appropriate geometric delay is applied to the signal at each element, incoherently combined elements do not need to be located close together. For the SKA, this means that while the core is being used for low angular resolution experiments, the mid and long baselines could be used for fast transient searches. Note that incoherent combination cannot account for the geometric delays within the beam, but away from the beam centre.
We assume that digitising, channelising and station beamforming are existing telescope functions and that these do not factor into the additional processing cost of incoherently combining the station beams. Square-law detection involves squaring and summing the real and imaginary components of each channel of each beam from each station. The averager integrates the power samples for each beam, and then equivalent beams from different elements are summed together, resulting in a total of N b–0 incoherently combined data streams. The integration of the power samples greatly reduces the data rate of the incoherent beams, although it comes at a cost of a lower time resolution.
A.2.2 Independently Pointed Subarrays, Incoherently Combined
A further increase in FoV can be achieved by pointing subarrays of elements in different directions and incoherently combining the signals from elements in the subarray. Table 7 shows performance attributes for this mode. The processing cost for incoherently combining subarrays is approximately the same as it is for incoherently combining all stations because the same number of beams need to be square-law detected and summed across stations; the difference is that separate sums need to be maintained for each subarray. The approximate beamforming operations cost is therefore independent of the number of incoherently combined subarrays (N sa).
The smallest subarray size is one element, meaning each element is pointing to a unique patch of sky — this is termed fly's eye. In reality, the minimum number of elements in a subarray would be 3, to allow for a triggered buffer to be used to localise any detected signal.
A.2.3 Coherent Combination — Array Beamforming
For coherent combination, each array beam is formed by the weighted sum of N 0 element beams pointing in the same direction. Geometric delays are applied to the signals from the elements, which are then summed and detected. Like AA stations, multiple beams can be formed. Figure 6 shows the steps for coherent array beamforming, and Table 8 shows the performance attributes. Cordes (2009) discusses array beamforming in the context of the SKA.
A.2.4 Correlation Beamforming — ‘Fast Imaging’
An alternative method to pixelise the full FoV is to create images from the correlator at a high time resolution. The highest time resolution available is specified by the correlation integration period, which must be short enough to prevent smearing. Cordes (2009) compares the computational cost of this method with the coherent sum, and finds that the cost-effectiveness depends on configuration and system temperature. However the major advantage is that the correlator hardware can be re-used. It should be noted that this means that the data must be able to be dumped from the correlator at these high rates which may place additional requirements on the correlator architecture. There is also a computational cost for gridding and imaging at high time resolution.
B Event Rate for a Broadened Pulse in a Volume of Sky
The detected event rate is calculated for observable space. We assume extragalactic sources of intrinsic luminosity are homogeneously distributed in a sphere of volume V with a nominal event rate density of ρ events s–1 pc–3. The event rate for this sphere is given by ρV:
where D is the radius of the sphere in pc. From Deneva et al. (2009), a source can be detected out to a maximum distance
where S i is the intrinsic flux density of the source.
The intrinsic flux density of a pulse differs from the observed flux density due to pulse broadening (or smearing) effects of the interstellar medium and of the detection system itself. However, if no energy is lost due to attenuation, then pulse ‘area’ is conserved such that S i W i = SW, where W i is the intrinsic width of the pulse, S is the observed flux density and W is the width of the broadened pulse. If the post-detection integration time τ is equal to W, a telescope with minimum detectable flux density S min,τ = W can detect the broadened pulse to a maximum distance
A more general relationship which uses an integration time of τ = W i is
Pulse-broadening factors are discussed further in Appendix C.
The nominal extragalactic population is observable out to D max for the fraction of the sky observed (Ωproc/Ωsky) and the detected event rate is
Ωproc, S min and W are functions of how the signals are combined and processed by the telescope system. S min = σΔS rms, where σ is the required S/N ratio and ΔS rms is the rms variation in flux density for a randomly polarised source, obtained by applying the telescope gain to the radiometer equation:
where T sys is the system temperature, A e is the total effective area used in the survey, Δν is the processed bandwidth and τ is the post-detection integration time, which also defines the time resolution.
C Pulse Broadening and Correction (dedispersion)
Cordes & McLaughlin (2003) model the broadening of a delta-function pulse due to propagation through the interstellar medium and signal-processing response times using the following approximation:
where Δt DM represents the broadening due to dispersion smearing; Δt δDM is due to the error in the DM, δDM, used by the system's dedispersion signal processing; Δt Δν is the system's filter response time; and τd is due to the multipath propagation effects of the medium. Expanding on this, the broadening of a pulse of intrinsic width W i can be modelled as
The filter response time, Δt Δν, is approximately equal to Δν–1, where Δν is the bandwidth of the filtered signal. It is important to note that for a fully coherent transient detection system where the element beams are coherently combined, coherently dedispersed and searched, Δν represents the full signal bandwidth; whereas for an incoherent transient detection system in which the signal is channelised, detected and searched, Δν represents the much smaller channel bandwidth. Consequently, the filter response component of pulse broadening, Δt Δν, is significantly higher for incoherent transient detection systems.
Furthermore, while coherent dedispersion techniques can completely correct for dispersion smearing (given that the DM is known), incoherent dedispersion techniques can only correct for dispersion between the filter-bank channels; they cannot correct for dispersion within the channels. Intra-channel dispersion smearing can be reduced by choosing smaller channel bandwidths, but at the expense of larger filter response times. The optimum channel bandwidth for incoherent dedispersion occurs where the dispersion smearing within each channel equals the filter response time (Hankins & Rickett 1975; Cordes & McLaughlin 2003). This leads to a minimum pulse width after incoherent dedispersion of
where . It should be noted that this optimum cannot be realised for all DMs, because it expects the channel bandwidth to be a function of the DM.
For coherent dedispersion the Δt DM term is completely removed:
where (as noted above) Δν is the full processed bandwidth and Δt Δν ≈Δν–1.
D Frequency Dependence of Low-Frequency Aperture Arrays
For illustrative purposes, we plot the event rate and a breakdown of its frequency-dependent components. Figure 7 shows for source luminosities with spectral indices ξ = –1.6 and 0, over a frequency range 70–450 MHz, at 1 MHz steps with processed bandwidth Δν = 1 MHz and normalised to at 70 MHz. The slope is steep: at 160 MHz, the event rate is 10% of the event rate at 70 MHz. At 450 MHz, the event rate is 0.016% of the 70 MHz event rate. A further breakdown of S min ν is shown in Figure 8.
E Event Rate as a Function of Frequency
The event rate is determined numerically by calculating the S/N ratio (SNR) for each of N ch frequency channels of width Δνch, weighting it by w, the amount of sky seen with that SNR, and then taking the root of the sum of the squares:
where is a constant. Given
Setting and so , we can say that
for a processed bandwidth of Δν =N chΔνch. Where there is no frequency dependence, the event rate for total bandwidth Δν and unit bandwidth Δνch = 1 becomes
as expected.