Abstract: | A plurality of remote sensing images of a scene are received. A potential target object can be identified in one of the images, wherein the target object has a low signal-to-noise ratio (SNR). A candidate motion path of the target object can be generated based upon the images. A predicted position of the target object along the candidate motion path is determined for each of the remote sensing images. An image chip is extracted from each of the images, where each image chip is centered about the predicted position of the target object in its corresponding image. A sum image chip is generated based upon the image chips. An indication that the potential target object is an actual object in the images is output based upon a value of a center pixel of the sum image chip. |