1. Introduction
In physical space, pointing interactions cannot rely on methods like cursors, rays, or virtual hands for pointing feedback as in virtual space. Instead, they must depend on the user’s senses and experience to capture targets. When users perform pointing-based target capture in physical space, their hands lack physical support [
1], and physiological tremors of the body [
2] hinder accurate target selection. During the process of pointing selection, users must also maintain the stability of the pointing device until confirmation of selection, such as pressing a button on a remote control or pulling a trigger. Such confirmation actions may alter the pointing direction, resulting in a deviation between the actual selected position and the target position, known as the Heisenberg effect [
3,
4]. The Heisenberg effect in human–computer interaction can have certain impacts. For example, hand-based pointing is susceptible to small tremors that cause deviations from the target [
5]. Similarly, discrete inputs, such as pressing a button, can disrupt the position of the interaction device, leading to spatial Heisenberg effects and introducing errors during the interaction process [
4]. When the target is far from the user, the stability of pointing selection is typically significantly affected [
6]. Interfering targets in physical space can also influence the accuracy of user pointing and selection.
Currently, research on pointing interaction technology primarily focuses on the VR/AR domain. It involves using handheld controllers to emit lasers into virtual scenes and select the first object intersecting with the beam [
7,
8,
9,
10]. Due to its simple and efficient characteristics, many studies are based on this pointing interaction technology [
11,
12,
13,
14]. Research on pointing interaction technology in physical space is relatively limited, mostly focusing on display screens, graphical interfaces, and pointing interaction technologies with cursor-based interfaces [
15,
16,
17,
18,
19,
20].
Over the years, researchers have also explored the distribution of endpoint selections. Murata observed that the distributions of horizontal and vertical coordinates of endpoints approximate a Gaussian distribution [
21]. Recently, Grossman et al. developed a probability model for two-dimensional rapid pointing tasks, showing that deviations in both directions increase with target distance [
22]. Bi et al. studied the distribution of touch input endpoints and proposed a dual-distribution hypothesis, deriving FFitts’ law based on endpoint distribution to predict finger touch performance [
23]. Yu et al. explored endpoint distribution models for pointing selections in VR environments, introducing the EDModel [
11]. Wei et al. proposed a prediction of endpoint distribution based on the eye and head [
24].
Currently, research on region partitioning algorithms in pointing interaction techniques is relatively sparse. Most pointing interaction methods initiate interaction only upon complete target selection. However, this approach requires targets to be strategically placed and may not adequately meet the distribution requirements for multiple targets. In complex pointing scenarios involving multiple targets, region partitioning algorithms often face issues with overlapping partitions. Therefore, there is a need to segment overlapping regions. Research in this area generally falls into two categories: region segmentation based on image recognition [
25,
26] and region partitioning based on Voronoi diagrams [
26,
27,
28,
29]. Voronoi diagrams can be applied in spatial distribution analysis [
30,
31,
32], trajectory planning [
33,
34], and coverage in sensor networks [
35]. Miah et al. proposed a region coverage optimization algorithm based on Voronoi diagrams [
36]; Li et al. utilized Voronoi diagrams for area division of event pixels and neighborhood construction [
37]; Li and Chen proposed a Voronoi-based method to solve the coverage problem of unknown deformable targets [
38]; Afghantoloee et al. introduced a purpose-oriented 3D Voronoi algorithm for optimizing sensor deployment in complex environments [
39].
In response to the aforementioned issues, we propose two target distribution models for pointing interactions in physical space: the single-target pointing endpoint distribution model (ST-PEDM) and the multi-target pointing endpoint distribution model (MT-PEDM). The ST-PEDM predicts the distribution of endpoint tasks for pointing selections based on target distance, visual error, and the Heisenberg effect. The MT-PEDM can predict the distribution areas of endpoint tasks for pointing selections in complex multi-target scenarios. Based on these two models, we propose a basic region partitioning algorithm (BRPA) and an enhanced region partitioning algorithm (ERPA). These algorithms exhibit high accuracy and efficiency, enabling precise determination of user intentions during pointing interactions. It can be applied to the rapid capture control and trajectory planning of drone swarms. Users can quickly and accurately capture and control target drones using pointing interaction technology, issue commands, and transmit data through smart glasses, thereby achieving effective drone control and trajectory planning, as shown in
Figure 1.
2. Models and Algorithms
The endpoint refers to the coordinates where the directional ray emitted from a handheld pointing device intersects with the plane of the target control device during user interaction. These endpoints are typically distributed around the target in a scattered pattern.
2.1. Single-Target Pointing Endpoint Distribution Model
The pointing selection task is divided into two stages: pointing and confirmation of selection. Previous studies [
7,
40] have shown that, during the pointing stage, factors such as directional tilt and physiological tremors lead to variability in endpoint locations, resulting in a two-dimensional normal distribution of endpoints. The discrepancy between actual and perceived endpoints due to inconsistent gaze and pointing directions affects the expected endpoint distribution model
. The probability density function of the two-dimensional normal distribution of pointing endpoints is given by Equation (
1), where
p is the correlation coefficient between
x and
y, and
,
,
, and
are the mean and standard deviations of the distributions of variables
x and
y, respectively.
The selection confirmation stage is also influenced by the Heisenberg effect, impacting the distribution of endpoints. The target distance
D, the Heisenberg effect, and their interactions affect the parameters of the two-dimensional average model for endpoint distribution, including the expectation
and the covariance matrix ∑, as shown in Equation (
2).
When engaging in directional interactive operations, as the target distance
D increases, the influence of selection error and physiological tremor on the range of endpoint distribution is magnified, thereby affecting the covariance matrix ∑. The impact of visual error interaction on the offset of the expectation µ of the endpoint distribution model is also amplified. Based on this, we propose an endpoint distribution model without the Heisenberg effect, as shown in Equation (
3), where constants
and
h are determined empirically.
The single-target endpoint distribution model with the Heisenberg effect is shown in Equation (
4), where constants b, c, d, m, n, o, p, q, and r are determined empirically.
We propose an experimental model based on target distance D, the Heisenberg effect, and visual error. Simultaneously, we explore the impact of four parameters on the endpoint distribution and predict the expectation µ and covariance matrix ∑ of the two-dimensional normal distribution through target distance.
2.2. Multi-Target Pointing Endpoint Distribution Model
Based on the derivation from the ST-PEDM, we propose a model for the endpoint distribution based on the distance between the pointing target and the interfering target and the direction of the interfering target.
In real-world scenarios, users typically do not separate selection confirmation from pointing tasks, leading to the Heisenberg effect during pointing tasks. Building upon Equation (
4), we derive the endpoint distribution model for pointing towards targets with interference.
The distance between the interfering target and the actual selection target significantly influences the endpoint distribution. When the distance between them reaches a critical value, the impact of the interfering target on the endpoint distribution becomes negligible. For interfering targets at different azimuths, the endpoint distribution shifts in the opposite direction from the interfering target.
The theoretical foundation of multi-target and single-target pointing tasks is similar. Building upon Equation (
4), we derive horizontal Equation (
5) and vertical Equation (
6) expressions for the positions and motion directions of horizontally and vertically interfering targets, respectively:
Integrating formulas for horizontal and vertical directions yields a model formula, as in Equation (
7). For the distribution of pointing endpoints in multi-target scenarios.
And, by extending Equation (
7) to multiple interfering targets, we derive Equation (
8).
are empirical constants, and
is the angle between the line connecting the pointing target and the interfering target and the horizontal axis. We propose an experimental model based on the distance and direction of interfering targets, simultaneously exploring the influence of two parameters on the endpoint distribution model. Additionally, we predict the expectation
of the two-dimensional normal distribution through variations in target distance.
2.3. Region Partitioning Algorithm
Based on the target distribution model of multi-target interference, two region partitioning algorithms, BRPA and ERPA, are proposed, respectively, for the single target direction selection scene and the complex multi-target selection scene. The area where the target is located is divided, and the user’s pointing and selection intention are judged. When the pointing endpoint falls in the divided area, the user is determined to establish interaction with the current device.
BRPA is suitable for the selection task of a simple scene with a single target. It is proposed based on the target endpoint distribution model without interference. When the coordinates of the pointing endpoint are in the region of the ST-PEDM, the decision interacts with the pointing target.
ERPA is suitable for the directional selection task in complex multi-target scenes, and is proposed based on the multi-interference target endpoint distribution model. First, the distance between the target and the interference target is calculated. When the distance is greater than 50 cm, as long as the coordinate of the pointing endpoint is in the region of the MT-PEDM, it is determined to interact with the pointing target. When the distance is less than 50 cm, the Voronoi diagram is used to divide the overlapping region again. The expected µx and µy on the
x and
y axes of the two overlapping regions are taken as vertices, and vertical bisecting lines are constructed to divide the overlapping region and form a continuous polygon region. It is implemented through Algorithm 1. In complex environments with multiple targets and interfering points, Voronoi diagrams provide a mathematically precise method for dividing regions, ensuring that each target point has its own dedicated region. As long as the coordinates pointing to the endpoint are within the final division area, the decision interacts with the pointing target, as shown in
Figure 2b.
Algorithm 1 Dividing regions and computing Voronoi diagram |
- 1:
Input: - 2:
Target points , - 3:
Interfering target points , - 4:
Threshold distance = 50 cm - 5:
Output: - 6:
Divided regions and Voronoi diagram for overlapping regions - 7:
Procedure CALCULATE_DISTANCE(p1, p2) - 8:
Return the distance between points and - 9:
End Procedure - 10:
Procedure DIVIDE_REGIONS() - 11:
Initialize - 12:
Initialize - 13:
for each target point in P do - 14:
Initialize region for - 15:
for each interfering target in I do - 16:
- 17:
if then - 18:
Add to ’s region - 19:
end if - 20:
end for - 21:
Add ’s region to - 22:
end for - 23:
for each pair of overlapping regions in do - 24:
Add bisector based on - 25:
end for - 26:
Return divided regions and Voronoi diagram for overlapping regions - 27:
End Procedure
|
3. Experiment Task and Measurements
3.1. Participants
In this study, 15 participants were recruited, consisting of 11 males and four females. Two participants were left-handed, while thirteen were right-handed. The age range of the participants was from 23 to 28 years, with a median age of 24 years and an average age of 24.6 years. The participants had a minimum height of 164 cm, a maximum height of 191 cm, and an average height of 173.6 cm. All participants were in good health without disabilities and had experience in directional interactive operations. The experiment for each participant lasted approximately 2 h and collected a total of 31,050 experimental data points.
3.2. Apparatus
The experimental equipment used in this study mainly included one computer, one projector, and one set of motion capture equipment (including 12 high-speed infrared cameras, operating at 100 Hz frequency, with sub-millimeter accuracy). The computer was primarily used for calculating endpoint coordinates, creating the pointing feedback interface, monitoring the experiment’s runtime status in the background, developing the experiment program, and analyzing data post-experiment. The motion capture system served as the experimental environment, automatically recording experimental data such as time and the three-dimensional coordinates of various markers. The projector provided visual information feedback for participants during the experiment.
3.3. Experimental Design and Procedures
We designed a pointing device equipped with markers, along with several planar circles marked with reference points as selectable targets, as shown in
Figure 3. First, the targets were placed within the range of the motion capture system, and their actual coordinates were determined based on the captured markers. Next, the experimenter held the pointing device and performed selection tasks at varying distances from the targets. Using the marker coordinates captured from the pointing device, an arrowed line was calculated to represent the virtual pointing trajectory. When the virtual ray intersected with a target, the target was considered successfully selected.
3.3.1. ST-PEDM Experiment
The experiment was divided into three groups: a no-Heisenberg effect experiment, a Heisenberg effect experiment, and a control group experiment. The experiment’s independent variable was the distance between participants and the target (60 cm, 120 cm, 180 cm, 240 cm, and 300 cm). In the no-Heisenberg effect experiment, participants used their dominant hand to hold the indicator for pointing actions, followed by the other hand to grasp the mouse and click the right button for confirmation actions after pointing to the target. After confirmation, the indicator box changed from pink to light green, and each target distance was repeated 50 times.
Heisenberg effect experiment: Participants used their dominant hand to hold both the indicator and the mouse for pointing actions. After pointing to the target, they clicked the right mouse button to confirm the action. Upon confirmation, the indicator box changed from pink to light blue. This process was repeated 50 times for each target distance.
Control group experiment: To mitigate the potential influence of subtle differences in the experimental environment on the distribution of endpoints, which could impact data comparisons with the first two experimental groups, the control group experiment maintained the same experimental environment. Participants alternated between holding the mouse with their dominant and non-dominant hands to perform clicking confirmations. After confirmation, the indicator box changed from pink to light blue for the Heisenberg effect condition and from pink to green for the no Heisenberg effect condition. This process was repeated 20 times for each target distance.
The overall design of the experiment yielded a total of 15 participants × 5 target distances × 50 trials for the pointing task × 2 groups (Heisenberg and no Heisenberg) + 15 participants × 5 target distances × 20 trials for the pointing task × 1 group (control) = 9000 experimental trials.
3.3.2. MT-PEDM Experiment
The experiment was divided into three groups: Horizontal Placement of Interfering Objects Experiment, Vertical Placement of Interfering Objects Experiment, and Diagonal Placement of Interfering Objects (Control Group) Experiment, as shown in
Figure 4. The distance between participants and the target was set at 180 cm. The independent variables of the experiment were the direction of interfering objects and the distance between the interfering objects and the target point (10 cm, 20 cm, 30 cm, 50 cm, and 75 cm).
Horizontal Placement of Interfering Objects Experiment: To investigate the impact of the orientation of target placement on the distribution of endpoint positions along the x-axis, two target points were placed horizontally. Participants alternately selected between the two target points and repeated the experiment 50 times. The distance of interfering points was adjusted for the next set of experiments.
Vertical Placement of Interfering Objects Experiment: To examine the influence of the orientation of target placement on the distribution of endpoint positions along the y-axis, two target points were placed vertically. Participants chose between the two target points alternately, and the experiment was repeated 50 times. The distance of interfering points was adjusted for the next set of experiments.
Diagonal Placement of Interfering Objects (Control Group) Experiment: To assess the reliability of the effects observed in the first two groups in a real-world scenario, considering the combined impact of horizontally and vertically placed interfering objects on endpoint distribution, interfering points were placed at a 45° angle relative to the target points. After participants completed 50 pointing tasks, the distance of interfering points was adjusted for the next set of experiments.
When participants engaged in the pointing task, the experimental interface indicated the target point that needed to be selected. Upon clicking the confirmation action, the target point indicator box in the experimental interface changed from pink to light green, signaling to the participants that they had completed the task, and data were recorded, as shown in
Figure 4.
Before the experiment began, participants were provided with a detailed explanation of the experimental procedures and requirements. Additionally, a single training session was conducted. The overall design of the experiment resulted in 15 participants × 5 distances between interfering objects and target points × 50 trials for the pointing task × 3 groups = 11,250 experimental trials.
3.3.3. Region Partitioning Algorithm Experiment
The design of the experimental task was similar to that of the ISO9241-9 [
11] multi-directional tapping task; 8 target points were set, each target point was set to a fixed size, and the target point was uniformly placed on a circle. The participants completed the corresponding orientation selection task at three different target distances of 100 cm, 200 cm, and 300 cm.
When participants performed the directional selection task, the experimental interface indicated the target points to be selected; the target to be selected was light red, and the remaining seven target points were light yellow, as shown in
Figure 2a.
Target selection completion time test experiment: When pointing to the endpoint and entering the target division area, the target point indicator box in the experiment interface changed from light red to light green, indicating that the participant had correctly selected the target and could click to confirm. Participants clicked the left mouse button, the timing ended, and the background recorded the selection time.
Pointing task accuracy test experiment: When participants thought they had pointed to the target, they clicked the left mouse button to confirm. Based on whether the pointing endpoint was in the target area, the background determined whether the pointing task was correct or wrong and recorded it.
Before the start of the experiment, the participants were informed about the experimental steps and requirements in detail, and 2 groups of training experiments were carried out. The design of the experiment yielded a total of 15 participants × 3 target distances × 8 targets × 5 times × 3 situations = 10,800 experiments.
5. Discussion and Conclusions
By investigating the impact of factors such as target distance, Heisenberg effect, visual error, hand tremor, and interfering target distance on endpoint distribution, we proposed ST-PEDM and MT-PEDM, and, based on these two models, we proposed BRPA and ERPA.
In the study of the directional endpoint distribution model, EDmodel has received positive feedback for pointing tasks in both AR and VR scenarios [
7]. We will investigate whether EDmodel fits pointing tasks in real-world scenarios. EDmodel proposes the interaction effect of target distance and target width on the covariance matrix. We fitted our experimental data into the ED model, and the fitting results are presented in
Table 7. The results indicate significant improvement in fitting our proposed ST-PEDM, making it more suitable for cursor-free pointing selection tasks in real-world scenarios.
Following a bivariate normal distribution, the endpoint distribution of single-target pointing tasks is determined by target distance and the Heisenberg effect. The distributions along the x-axis and y-axis are relatively independent. This model accurately describes the distribution area of endpoints in single-interaction target scenarios.
The endpoint distribution of multi-target pointing tasks is determined by target distance, the Heisenberg effect, distance to interfering targets, and their orientation. The offset in the mean of the endpoint distribution model is strongly correlated with the distance to interfering targets. This model provides a relatively accurate description of the distribution area of endpoints in complex scenarios with multiple targets.
Through data analysis of completion times for pointing selection tasks, it is evident that BRPA and ERPA significantly improve completion times for real-space cursor-free pointing selection tasks. Moreover, the advantages of these algorithms are more pronounced when the target distance is greater.
Although the model proposed in this paper achieved good fitting in experimental validation, there are still some limitations. The experimental model was conducted using the high-precision motion capture platform in a laboratory setting. In real-world scenarios, sensor accuracy is often limited by device size and portability constraints, making it difficult to match the precision of laboratory platforms. Indoor positioning and pointing angle determination typically rely on ultra-wideband technology, which inherently has sub-meter level distance errors and 15-degree angular errors, differing from the precision of the endpoint distribution model’s experimental platform.
For future work, we aim to validate the applicability of the proposed model and region partitioning algorithms by assembling a pointing selection platform that utilizes ultra-wideband technology. Additionally, we plan to apply the proposed model and algorithms to the rapid capture control and trajectory planning of drone swarms. This approach will enable precise and rapid targeting and control of target drones using pointing interaction technology, combined with smart glasses for command issuance and data transmission, thereby achieving drone control and trajectory planning.