2.1. Research Status of On-Orbit Processing of Space Remote Sensing
Since the 1990s, researchers have progressively conducted investigations on the pivotal technologies of space-based intelligent remote sensing satellites. In recent years, global space powers have placed significant emphasis on the real-time onboard data processing capability in orbit and its application in national defense infrastructure and space security. Due to the vast amount of remote sensing satellite image data, the scarcity of domestic satellite ground receiving stations, and the limitations in satellite-ground data communication bandwidth, there are practical application demands for efficient down transmission, rapid information extraction, and emergency support concerning massive remote sensing data. Therefore, the intelligence of space-based remote sensing satellites constitutes the mainstream in the development of advanced remote sensing satellite systems. The concept of space-based intelligent remote sensing satellites emerged from on-orbit real-time processing technology. With advancements in intelligent chip and artificial intelligence, the research and application of intelligent remote sensing satellites are encountering novel challenges and opportunities [
6].
The United States has been developing satellite real-time processing technology in orbit according to the established plan. As early as around 2000, they initiated the development of intelligent remote sensing satellites. In terms of hyperspectral remote sensing data, the US Air Force has equipped the onboard autonomous processor on TacSat-3 tactical satellite, enabling independent planning of acquisition mode for the onboard hyperspectral imager as well as real-time processing and storage of image data [
7]. The United States Naval Research Institute has deployed an image parallel processing array on the Naval EarthMap Observer (NEMO) satellite, which is based on multiple digital signal processing units [
8]. This advanced system enables real-time feature extraction and data compression of hyperspectral data onboard, with the processed results directly transmitted to the ground for immediate application. The National Aeronautics and Space Administration (NASA) launched the EO-1 hyperspectral satellite with the objective of enabling applications such as extraction of hyperspectral image information and detection of changes in orbit, thereby optimizing data acquisition time for downstream transmission to various applications [
9]. In terms of infrared remote sensing images, the space-based early warning satellites in the US National Defense Support Program achieve image redundancy background data removal, target detection and tracking, and fast data down transmission through on-orbit processing. The extended Space Based Infrared System (SBIRS) is employed for direct recognition and extraction of multiple targets at the missile terminal [
1]. In terms of video data, the United States Skybox company has applied H.264 and other general video coding technologies to satellite video compression, enabling direct application of video data for target recognition and tracking [
8]. The US Department of Defense Advanced Research Projects Agency and SpaceX have launched the "Black Jack" low-orbit satellite constellation project, leveraging the cost-effective benefits of commercial satellite platforms and payloads, integrating cutting-edge technologies such as artificial intelligence and on-orbit large-scale distributed computing to achieve autonomous real-time processing of satellite data in orbit [
9]. This initiative caters to global military intelligence reconnaissance, combat command and control, and other application scenarios.
European space agencies have also made significant advancements in the development of intelligent remote sensing satellites, as well as conducted extensive research on on-orbit processing technology for various types of remote sensing image data, resulting in notable achievements [
1,
3]. Around 2000, the European Space Agency initiated the PROBA project to conduct research on on-orbit processing of remote sensing data. Subsequently, in 2009, they launched the PROBA-2 satellite with enhanced capabilities for on-board spectral channel programming, merging, and on-orbit data processing of imaging spectral data. Since 2017, research on space-based image processing in orbit has been conducted using the Hyperscout-2 satellite-borne imager. The Phisat-1 satellite is equipped with an embedded intelligent processing unit within a neural network, exhibiting fundamental characteristics of an intelligent remote sensing satellite [
10]. In October 2001, the German Space Center developed the BIRD small satellite, which was equipped with on-board processing capabilities to accomplish tasks such as preprocessing visible light and infrared data and conducting on-orbit application analysis [
11]. Since 2011, the French Space Center has successfully launched the Pleiades series of high-resolution optical satellites, namely Pleiades-1A and Pleiades-1B satellites, to accomplish on-orbit real-time acquisition, correction, compression, and other onboard preprocessing tasks for optical remote sensing image data [
12].
Numerous research institutes in China have conducted extensive investigations on space-based intelligent remote sensing satellite technologies at various levels. The "Tianzhi-1" intelligent optical remote sensing satellite, launched by Institute of Software Chinese Academy of Sciences (ISCAS) in 2018, represents China's pioneering software-defined satellite operating in orbit [
13]. It encompasses intelligent deployment tasks, advanced measurement and operation control capabilities, as well as sophisticated information processing functionalities. Leveraging cloud computing platforms, it intelligently deploys on-orbit data processing tasks and achieves rapid pre-processing for ground-based analysis. The first intelligent remote sensing scientific test satellite, Luojia3 was jointly developed by Wuhan University and Aerospace Dongfang Hong Satellite Co., Ltd. It is equipped with a software-defined multi-mode optical imaging payload. Additionally, an intelligent on-orbit real-time processing technology system of "mission planning → sensor calibration → target detection → intelligent compression" has been proposed [
1]. The Wuhan University and Yantai Municipal Government of Shandong Province have jointly proposed the establishment of the Oriental Smart Eye (OSE) intelligent remote sensing satellite constellation program. This program incorporates an autonomous on-orbit intelligent processing terminal, integrates Beidou short message and inter-satellite real-time transmission terminals, and employs cooperative networking, intelligent scheduling, joint observation, and interconnection of intelligent terminals within the satellite constellation to achieve real-time intelligent service for satellite remote sensing information. The ultra-low orbit satellite Integrated constellation developed by China Aerospace Science and Industry Corporation (CASIC) establishes a real-time remote sensing service application demonstration system through on-board intelligent processing, end-to-end satellite connectivity, and inter-satellite communication. This enables intelligent computation of on-orbit remote sensing satellites, precise "sensing" capabilities at the nanometer scale, and real-time "transmission" at the minute level [
14].
With the advancement of commercial space development, the "Jilin-1" satellite series developed by Changguang Satellite Technology Co., Ltd. incorporates on-orbit real-time intelligent processing systems into its spectral 01 and 02 magnitude platforms, facilitating on-orbit image compression, algorithm injection deployment, and on-orbit update functionalities. Notably, the Jilin-1 platform's satellites 02A01 and 02A02 have achieved a groundbreaking milestone in China by enabling ultra-high-speed high-resolution remote sensing image transmission through inter-satellite laser technology at a rate of 100Gbps, successfully transmitting inter-satellite data to ground stations [
15]. The "Chaohu-1" series synthetic aperture radar (SAR) remote sensing satellite, developed by Changsha Tianyi Space Science and Technology Research Institute Co., Ltd., is equipped with a high-performance on-board processing platform to achieve real-time target identification and segmentation of SAR images in orbit, enabling transmission of the report through the Beidou System. The "Taijing III-02 satellite developed by Beijing Minospace Technology Co., Ltd. is equipped with a high-performance intelligent computing platform, enabling efficient optical image compression, target recognition, and positioning services for users [
16]. In the realm of commercial space remote sensing satellite constellations, such as the "PIESAT constellation," "Tianfu constellation," and "Zhuhai-1," various levels of experiments on space-based on-orbit data processing have been conducted. Targeting the research, production, and service mode of domestic civilian and commercial remote sensing satellite systems, this study aims to overcome key technologies such as space-based high-performance computing architecture, intelligent mission scheduling and planning, and on-orbit real-time intelligent service. It strives to establish a space-based information real-time intelligent service system with a space-based cloud brain system, seamless integration of space-time data, and intelligent decision-making capabilities. The objective is to provide users with real-time access to space-based remote sensing observations in order to enhance the efficiency, timeliness, and accuracy of remote sensing satellite applications [
17].
2.2. Development Status of Space On-Orbit Processing Platform
The application mode of space remote sensing satellites involves the collection of image data in orbit, transmission of data to the location processing center through satellite-ground communication links, and realization of various data processing, sharing, and distribution for different professional fields or application departments at all levels on the ground. However, there are increasing challenges related to contradictions between satellite remote sensing data and satellite-ground data transmission links as well as a shortage of domestic ground receiving stations. The traditional application mode "space-sensing ground-computing" was unable to meet the demands for high time-sensitive and emergency tasks. On-orbit real-time calculation of satellite remote sensing images presents an effective solution; however, insufficient space-based on-orbit computing power remains a major obstacle. Typically, the computing power of traditional remote sensing satellite chassis is less than 10Tops with a weight around 10kg primarily consisting of CPUs which fails to meet current rapid development requirements for space missions. Considering that 1kg weight corresponds to a launch cost of approximately 150,000 yuan (CNY), it becomes crucial for onboard processing machines as part of the satellite in orbit system to fulfill performance requirements while minimizing volume and weight [
18].
The processors of space-based on-orbit intelligent processing platform primarily consist of CPU, GPU, FPGA, neural processing unit (NPU), digital signal processing (DSP), and other types [
19]. Considering the characteristics of on-board computing tasks, the CPU serves as the computing and control core of a computer and is mainly utilized for instruction scheduling and control. It is suitable for scalar computation but not for accelerated computation of deep learning models. The GPU exhibits high parallelism, high memory bandwidth, and fast execution speed. By employing CPU control calls, the computational performance of the GPU can be significantly enhanced. The multi-core clustering mode satisfies the requirements for highly parallel computing in large-scale data processing and complex computational tasks. The NPU emulates biological neural network systems by simulating biological neurons and synapses at the circuit level while customizing designs to accommodate deep learning networks' characteristics thereby achieving processor storage integration with computation capabilities. The FPGA operates closer to underlying IOs with abundant logic units enabling efficient parallel acceleration of deep learning processing models; however, it faces challenges in implementing complex algorithms due to its high-cost implications. As a specialized microprocessor, the DSP is predominantly employed for matrix multiplication and addition operations necessitating combinations with multiple DSPs alongside other processors [
20].
The United States Space Development Agency (SDA) has proposed the National Defense Space Architecture (NDSA) for strategic space development deployment, based on military operational requirements such as space-based high-performance computing [
22]. This architecture is structured into seven layers: transmission, combat management, tracking, supervision, emerging capability (deterrence), navigation, and support. Each NDSA satellite in the combat management layer is equipped with edge computing payload primarily responsible for on-orbit data processing. It possesses intelligent distributed management capabilities along with space-based task processing and communication transmission abilities. Amazon Web Services leverages onboard computing to eliminate data transmission delays and bandwidth limitations by deploying a suite of edge computing products on low-orbit satellites. By utilizing cloud technology directly on the satellite, it enables automatic collection and analysis of vast amounts of raw satellite data while transmitting valuable results through satellite-based communications for storage or further analysis.
NASA's Goddard Space Flight Center has pioneered a novel hybrid space-based on-orbit data processing scheme that integrates commercial devices with radiation hardening methods, proposing an advanced space-based computing architecture capable of optimizing the performance of CPUs, DSPs, and FPGAs [
20]. Microchip has collaborated with NASA's Jet Propulsion Laboratory to design and manufacture high-performance space computing (HPSC) processors, aiming to develop a space-resistant system on chip (SoC) that can replace the existing spacecraft's space-grade chips while maintaining identical power consumption, size, and weight. Consequently, the computational capabilities in space have been enhanced by over 100 times [
23]. In August 2023, Sidus Space, a US-based satellite manufacturing company, acquired Exo-Space, a leading company specializing in space edge computing. This acquisition aims to integrate advanced artificial intelligence software technology into Earth observation satellites. Exo-Space combines anti-radiation hardening hardware with flexible and customizable elastic software. By leveraging the software platform, it enables on-board transmission optimization, updates and modifications of payloads. Furthermore, this integration facilitates collaboration among satellite constellations, computing chips, and software platforms to enhance the comprehensive utilization value of satellite hardware while unlocking their application potential across diverse fields [
24]. The on-board processing load of the BIRD satellite consists of FPGA, DSP, and other processors. The on-board high-resolution imaging spectrum processing system of the PROBA-2 satellite is a dedicated DSP system. Additionally, the on-board processor of the Pleiades series satellites utilizes FPGA technology to accomplish image processing tasks in orbit [
21].
The domestic space-based on-orbit computing capacity has reached parity with that of the United States, and it possesses a robust domestic independent and controllable research and development capability, which has been validated through the successful operation of numerous civil and commercial satellites or constellations. Zhuhai Hangyu Micro Technology Co., Ltd. is engaged in developing a new generation of aerospace SOC series chips as well as various space-based computers for satellite, spacecraft, aircraft, and other applications. These advancements encompass mainstream architecture systems such as scalable processor architecture (SPARC) and reduced instruction set computer (RISC), including navigation and communication chips, master control chips, artificial intelligence chips, and general computing chips [
24]. Notably among these developments is the independently developed Yulong 810A chip featuring a primary processor utilizing a 4-core advanced RISC machines (ARM) A9 along with a coprocessor comprising GPU and the neural network accelerator (NNA) units; this chip boasts an impressive floating point computing capacity of 64GFlops alongside a fixed-point computing capacity of 12Tops. The BM3883MARH, a third-generation domestic 8-core processor developed by the Beijing Institute of Microelectronics Technology, adopts the anti-radiation SPARC architecture and achieves a floating-point operation performance of 32GFlops and a fixed-point operation performance of 16GOPS [
25]. For space-based on-orbit processing, Xi'an Jiaotong University has designed a heterogeneous computing unit consisting of class 4 FPGA and class 6 DSP reconfigurable components. Additionally, Xi'an Institute of Space Radio Technology has developed an on-board processing platform utilizing large-scale FPGA and multi-core DSP to support intelligent processing and analysis of remote sensing images in orbit. Shandong Aerospace Electronics Technology Institute has also developed an on-board image real-time processor with a hybrid architecture combining FPGA+CPU+NPU+DSP/GPU based on the domestic commercial Cambrian Chuangzhi-2. Customized anti-irradiation reinforcement and system heat dissipation design have been implemented according to the requirements for space radiation environment applications [
20]. StarDetect Technology (Beijing) Co., Ltd., for the first time, has adopted a super-heterogeneous computing system comprising intelligent reconfigurable CPU+FPGA+GPU+AI, along with conducting multi-dimensional system-level COST chip anti-irradiation reinforcement design to achieve high computing power up to 275Tops per kilogram intelligent load and high energy efficiency ratio of 4Tops/Watt. This technology leads two generations ahead compared to foreign FPGA+GPU architectures with an on-orbit time exceeding international peers by more than one year.
In the field of artificial intelligence chips, CPUs or ARM cores are initially employed for scheduling processing tasks, followed by achieving large-scale parallel computing through GPUs, FPGAs, or ASICs with diverse architectures such as Google's TPU, Horizon's BPU, Cambrian and Huawei's NPU. On-board intelligent computers face a dilemma in meeting the increasingly high-performance requirements for space-based cloud computing and massive remote sensing image processing tasks due to the inability to use traditional anti-irradiation chips while being hesitant to adopt high-performance commercial devices [
26]. The on-board intelligent processing platform is capable of effectively meeting the requirements for intelligent processing of remote sensing images, with special reinforcement processing necessary for satellite-borne environments. Depending on the specific demands of space applications such as computing power and localization, different architectures are typically adopted. For high-orbit applications, FPGA+DSP architecture is commonly employed to ensure reliability; whereas low-orbit satellite constellations tend to favor CPU+GPU/NPU architecture in order to meet the demand for extensive computing power. In terms of device selection, notable domestic high-performance processors include Feiteng CPU, Fudan University Microelectronics 7 series FPGA, National University of Defense Technology FT-M6678 DSP, Huawei Atlas 200 intelligent processing module etc., while foreign NVIDIA companies offer high-performance GPUs such as TX2, AGX and Orin [
27].
2.3. Research Status of Intelligent Processing Algorithms on Satellites
In the interpretation and analysis of remote sensing images, a wide range of neural network models have been utilized, encompassing shallow architectures such as early back propagation (BP) neural networks and support vector machines (SVM), as well as deep learning frameworks like convolutional neural networks (CNN) and the you only look once (YOLO) series, these techniques have found extensive applications in remote sensing domains including target detection and recognition, semantic segmentation and classification, as well as change monitoring. Moreover, it has expanded into research areas such as multi-modal fusion of remote sensing data, image description, and knowledge deduction based on artificial intelligence [
28]. In addition to academic research efforts, commercial companies have also conducted investigations into large models for remote sensing applications. The intelligent processing and analysis of remote sensing image based on deep learning realizes end-to-end detection through sample set construction, model training and prediction. However, compared with natural scenes, remote sensing image samples are relatively insufficient, models are fragmented, data sources are diversified, etc. Therefore, it is imperative to explore intelligent processing methods for remote sensing images such as small sample learning, incremental learning and transfer learning [
29]. Additionally, considering the limited computing resources of satellites, the effectiveness and efficiency of applying model algorithms on the ground may not meet the application requirements of the actual scene. In addition, when deploying the algorithm model on the satellite, it is also necessary to perform model lightweighting and algorithm acceleration processes such as pruning, quantization, and weight sharing.
The research on intelligent real-time processing algorithm of space-based remote sensing data primarily focuses on mission planning, data acquisition, processing and analysis, storage and distribution, as well as transmission of the acquired data [
30]. Due to the vast amount of remote sensing image data, the conventional process-oriented model for producing remote sensing products is no longer suitable for on-board processing, necessitating a shift towards task-driven or event-based sensing. In terms of on-orbit processing algorithms for remote sensing images, extensive research and verification work has been conducted by both domestic and international research institutes as well as commercial companies, such as on-orbit real-time task planning, image analysis, calibration and correction, intelligent interpretation and data compression [
31]. As an illustrative example, the United States Space Development Administration and commercial space enterprise SpaceX have integrated artificial intelligence, machine learning, and other advanced technologies to automate the processing, in-depth analysis, and practical utilization of space remote sensing images. The National Aeronautics and Space Administration (NASA), the Defense Advanced Research Projects Agency (DARPA), the German Space Center, and other institutions employ on-board mission scheduling planning and data analysis to facilitate real-time utilization of on-orbit data in military target identification and fire detection scenarios. This serves to support time-critical task requirements such as military target reconnaissance, combat command and dispatch, as well as geospatial intelligence acquisition. Chinese scientific research institutions such as Wuhan University and the Chinese Academy of Sciences have conducted extensive research on on-orbit processing algorithms for satellites. They possess the capability to effectively utilize high-resolution optical, infrared, and SAR remote sensing images for a range of on-orbit processing tasks, encompassing image compression, aircraft and ship target detection, cloud and fog identification, as well as supporting algorithm model updates during satellite operation [
32].
2.3.1. On-Orbit Satellites Task Scheduling
With the improvement of satellite payload capacity, space business has gradually transformed into non-single exploration missions. China has developed a real-time space information service system that integrates satellite positioning, navigation, timing, remote sensing, and communication (PNTRC) to achieve the goal of providing intelligent space information services in real-time through single-satellite multi-purpose functionality, multi-satellite collaboration, and satellite-ground inter-connection [
33]. The common technical key point of different mission satellites is to give satellites the ability of autonomous decision-making, communication coordination, scheduling and other on-board mission planning according to dynamic observation requirements, operation status and mission content [
34].
The satellites task scheduling planning can be categorized into two levels: single-satellite autonomous task scheduling planning and multi-satellite cooperative task scheduling planning. The evolution of single-satellite autonomous task scheduling has progressed from ground-based off-line scheduling to on-board scheduling and integrated satellite-ground scheduling. However, multi-satellite cooperative mission scheduling has developed from multi-satellite ground scheduling to multi-satellite on-board scheduling to multi-satellite cooperative scheduling. The primary methods of single-satellite scheduling strategy encompass the Autonomy Generic Architecture - Test and Application (AGATA) utilized by the Pleiades series satellites of the French Space Agency, NASA's Remote agent (RA), and the Continuous Activity Scheduling Planning Execution and Replanning System (CASPER) [
35]. The small-scale multi-satellite mission scheduling in terms of multi-satellite cooperative scheduling typically employs traditional algorithms such as greedy and backtracking [
36]. On the other hand, the more complex multi-satellite online system mission scheduling adopts a centralized architecture where one spacecraft serves as the primary satellite within the satellite formation [
37]. This approach enables mission planning and scheduling based on resource balancing among the primary satellite of the satellite cluster. Subsequently, in order to avoid the disadvantages caused by the failure of the central satellite, scholars proposed that the decentralized distributed task scheduling method has better environmental adaptability [
38].
2.3.2. On-Orbit Data Compression
As one of the key technologies in on-orbit information processing, on-orbit data compression plays a crucial role in alleviating storage constraints for remote sensing satellites. Employing high compression ratio algorithms enables the transmission of more valuable information within limited communication bandwidth [
39]. The differential pulse code modulation (DPCM) compression algorithm was initially implemented in optical satellites such as SPOT, QuickBird, WorldView, and GeoEye series [
40]. These satellites utilize two primary algorithms for data compression on radar imaging satellites: block adaptive quantization (BAQ) and block floating point quantization (BFPQ) [
41]. The "Ziyuan" series satellites have achieved on-board data compression using pulse code modulation (PCM) and DPCM encoders for the first time in China. The discrete wavelet transform (DWT) method is widely used in on-orbit satellite image data compression, such as Pleiades-1, ICESat-2, Gaojing-1, Taijing-3, etc. However, the on-orbit compression ratio achieved by the DWT method is typically less than 10 times [
22]. In recent years, the advancement of artificial intelligence technology has led to the emergence of more sophisticated on-board data compression methods. Among these methods, task-oriented intelligent compression techniques have demonstrated superior capabilities in achieving higher rates of data compression for on-board remote sensing images. The information extraction methods such as target detection, change monitoring and image segmentation are used to extract the region of interest (ROI) for Luojia-3, and then selects a compression model suitable for ROI, the bit rate is allocated adaptively to improve the compression ratio and data transmission efficiency of on-orbit images [
1].
2.3.3. On-Orbit Data Intelligent Interpretation
The primary objective of intelligent interpretation of satellite remote sensing im-ages is to accurately locate, classify, and identify changes in the target of interest. In military intelligence reconnaissance, target monitoring, and disaster emergency rescue operations, computational efficiency plays a crucial role in ensuring effective implementation. The algorithms for object detection in remote sensing images based on deep learning can be categorized into two groups: region-based methods using candidate regions and regression analysis. The target detection method based on the candidate region is divided into two steps. Firstly, a series of candidate regions that potentially contain targets are generated. Secondly, the target and background boundary box of each candidate region are classified through regression. Representative algorithms for this approach include region convolutional neural network (R-CNN) and a series of enhanced algorithms based on Faster R-CNN, which aim to improve the representation of target features, optimize the generation and processing of regions of interest, and enhance the accuracy of target positioning [
42]. The remote sensing target detection based on regression analysis has two types of algorithms based on YOLO and single shot multi-box detector (SSD) frameworks [
43]. The main difference is that there is no need to generate a separate candidate region. Instead, the bounding boxes and categories of the target are directly regression analyzed from multiple positions of the input image. The YOLO series algorithm has the advantage of easy lightweight deployment, and the on-orbit verification is the most in real time processing on board. The difficulty of on-orbit target detection lies in balancing the contradiction between detection accuracy and on-orbit computing resources. The YOLO and other algorithms using downsampling will lead to information loss, while using sliding window detection will affect the detection efficiency, and will face the existence of multi-target, small target and multi-scale target detection problems. However, the introduction of attention mechanism based on Transformer model or combined with improved algorithms such as YOLO can achieve effective detection of multi-scale targets [
42]. In view of the characteristics of high resolution and high real-time remote sensing on board, it is necessary to study and design a lightweight and efficient network model suitable for remote sensing data processing on board [
44].
In view of the application requirements of on-orbit processing in remote sensing images, it has important research significance in the algorithms of small unsupervised learning target detection based on depth science, dynamic target detection of satellite video data, multi-source data fusion target detection and so on [
45]. In addition to the application of object detection, there are also application directions such as image scene classification, semantic segmentation and change detection. When the above algorithms are implemented in on-orbit real-time processing on board, scene segmentation classification is a data processing based on a single data source, which has something in common with the object detection algorithm in terms of algorithms [
46]. The main difference between them lies in the difference of network model construction, sample set production and output results [
47]. However, the remote sensing image change monitoring algorithm is the comparative analysis of multi-period data [
48]. Usually, based on the change monitoring of multi-period remote sensing images, the image is classified first, and then the difference of the classification is compared, so as to realize the change detection and monitoring of remote sensing images, which puts forward higher requirements for the change detection of remote sensing images on board [
49].
With the development of natural language processing (NLP) technology, the application of large models in all walks of life has gradually emerged, as shown in
Table 1. Remote sensing large model is also called remote sensing pre-training basic model. A large number of unlabeled remote sensing images are used to train large model to extract general feature representation, so as to improve performance, efficiency and versatility. Three key factors are involved: the pre-training dataset, the number of model parameters and the pre-training technique. However, the drawbacks of large model training cost and large number of parameters are not conducive to practical application, such as generative pre-trained transformer 4 (GPT 4) training on 25,000 A100 Gpus, with 1.8 trillion parameters, inference cost of 525 billion Davinchi, its cost is about
$60 million. To develop large-scale visual language models for multi-modal data analysis in the field of Remote Sensing, such as the remote sensing foundation model (RingMo) first proposed by the Institute of Aerospace Information Innovation of the Chinese Academy of Sciences for generative pre-training of cross-modal remote sensing data [
50]. The model can automatically extract common features of remote sensing ground objects, and has strong generalization ability for new tasks, and supports multi-modal multitasking. Subsequently, the SenseEarth large model of SenseTime Technology and the AI Earth segmentation foundation model (AIE-SEG) of Alidama Institute appeared successively. The contrastive language-image pre-training (CLIP), remote sensing GPT (RSGPT) and other multi-modal remote sensing intelligent interpretation models built on video, image and text data have been successfully applied to remote sensing image target recognition, object classification, image description and other aspects, gradually deepening the intelligent application of remote sensing data.
In summary, when various large remote sensing models are deployed and applied on the ground, many problems such as the simplification, fragmentation and cross-modal application of deep learning models can be solved. However, the deployment and use of large models requires sufficient computing power resources and sample resources to ensure the training application of large models. If the large model of remote sensing is deployed to the satellite for on-orbit application, it needs the support of large-scale computing power and the lightweight of the algorithm model. Therefore, building a space computing network with large-scale computing power is the primary prerequisite for realizing intelligent computing on large model satellites.