In this section, extensive experiments are presented to assess the suggested Instance-aware Visual Language Map (IVLMap) in both simulated and real-world settings. Leading baselines like VLMap, CoW, and CLIP Map were compared against IVLMap using the Habitat simulator and the Matterport3D dataset. Results show that IVLMap performs better in instance-aware and multi-object navigation tasks, allowing for zero-shot goal navigation and accurate localization.In this section, extensive experiments are presented to assess the suggested Instance-aware Visual Language Map (IVLMap) in both simulated and real-world settings. Leading baselines like VLMap, CoW, and CLIP Map were compared against IVLMap using the Habitat simulator and the Matterport3D dataset. Results show that IVLMap performs better in instance-aware and multi-object navigation tasks, allowing for zero-shot goal navigation and accurate localization.

IVLMap Solves Robot Navigation By Mapping Individual Objects

2025/11/10 19:41
14 min read

Abstract and I. Introduction

II. Related Work

III. Method

IV. Experiment

V. Conclusion, Acknowledgements, and References

VI. Appendix

\

IV. EXPERIMENT

A. Experimental Setup

\ We employ the Habitat simulator [35] alongside the Matterport3D dataset [2] to assess performance in multi-object and spatial goal navigation tasks. Matterport3D is a comprehensive RGB-D dataset, featuring 10,800 panoramic views from 194,400 RGB-D images across 90 building-scale scenes, designed for advancing research in scene understanding indoor environments. For map creation in Habitat, we capture 13,506 RGB-D frames spanning six distinct scenes and document the camera pose for each frame, utilizing the cmu-exploration environment we established(Sec.IV-B). Due to computational constraints, our navigation experiments and the execution of the Large Language Model(Llama2) are conducted on separate servers. Llama2 operates on a server equipped with two NVIDIA RTX 3090 GPUs. As for IVLMap experiment, our experimental setup comprises an NVIDIA GeForce RTX 2080 Ti GPU with 12GB VRAM. Communication between these two servers is established using the Socket.IO[4] protocol.

\ Baseline: We assess IVLMap in comparison to three baseline methods, all employing visual-language models and demonstrating proficiency in zero-shot language-based navigation:

\

  1. VLMap [5] seamlessly integrates language and visuals, autonomously constructing maps, and excels in indexing landmarks from human instructions, enhancing language-driven robots for intuitive communication in diverse navigation scenarios with open-vocabulary mapping and natural language indexing.

    \

  2. Clip on Wheels (CoW) [30] achieves language-driven object navigation by creating a target-specific saliency map using CLIP and GradCAM [36]. It involves applying a threshold to saliency values, extracting a segmentation mask, and planning the navigation path based on this information.

    \

  3. The CLIP-features-based map (CLIP Map) serves as an ablative baseline, projecting CLIP visual features onto the environment’s feature map and generating object category masks through thresholding the feature similarity.

\ Evaluation Metrics: Similar to previous methods [5], [37] in VLN literature, we use the standard Success Ratemetric(SR) to measure the success ratio for the navigation task. We assessed our IVLMap’s effectiveness by (i) presenting multiple navigation targets and (ii) using natural language commands. Success is defined as the agent stopping within a predefined distance threshold from the ground truth object.

\ B. Dataset acquisition and 3D Reconstruction

\ To construct a map in visual and language navigation tasks, it is crucial to acquire RGB images, depth information, and pose data from the robot or its agent while it is in motion. Common datasets on the internet, such as Matterport3D [2], Scannet [38], KITTI [39], may not be directly applicable to our scenario. Therefore, we undertook the task of collecting a dataset tailored to our specific requirements. Our data collection efforts were conducted in both virtual and real(Appendix.C) environments to ensure comprehensive coverage.

\ Fig. 4. We created an Interactive Dataset Collection Scheme by combining the cmu-exploration development environment with the Habitat simulator. This involves integrating cmu-exploration’s autonomous exploration with Habitat robot agents for a unified dataset collection approach.

\ Interactive Data Collection in Virtual Habitats and CMU-Exploration Environment. CMU-Exploration [40], designed for autonomous navigation system development, offers

\ various simulation environments and modules. Combined with the Habitat Simulator [35], it forms a platform where users develop and deploy navigation systems for real robots. Our interactive data collection system in this virtual environment utilizes CMU-Exploration’s Joystick and Visualization Tools for waypoint setting. Waypoints undergo local planning, terrain and radar analysis, and state estimation, generating control commands for ROS-based robot motion. Simultaneously, robot pose data is sent to the Habitat simulator, providing RGB, depth, and pose information for direct sensor control. See Fig. 8 for an overview.

\ Compared to other black-box data collection methods in the Habitat simulator, where predefined routes or exploration algorithms are used, this approach offers strong controllability. It allows tailored responses to the environment, enabling the collection of fewer data points while achieving superior reconstruction results. For the same scene in Matterport3D, our approach achieves comparable results to the VLMap’s original authors while reducing the data volume by approximately 8%. In certain areas, the reconstruction performance even surpasses that of the original authors. To compare the results, refer to Fig.5(a) and Fig.5(b), for more detailed results of our 3D reconstruction bird’s-eye view, please refer to Appendix D.

\ Fig. 5. 3D Reconstruction Map in Bird’s-Eye View

\ C. Multi-Object Navigation with given subgoals

\ To assess the localization and navigation performance of IVLMap, we initially conducted navigation experiments with given subgoals. In the navigation experiments conducted in the four scenes of the Matterport dataset, we curated multiple navigation tasks for each scene. Each navigation task comprises four subgoals. The robot is instructed to sequentially navigate to each subgoal of each task. We use the invocation of the ”stop” function by the robot as a criterion. If the robot calls the ”stop” function and its distance to the target is less than a threshold (set to 1 meter in our case), it is considered successful navigation to that subgoal. Successful completion of a navigation task is achieved when the robot successfully navigates to all four subgoals in sequence. It is noteworthy that we provided instance information of objects in the given subgoals to examine the effectiveness of our constructed IVLMap.

\ \ Our observations(Table.I) indicate that our approach outperforms all other baselines. It exhibits a slight improvement

\ TABLE IOUTCOME OF MULTI-OBJECT NAVIGATION WITH SPECIFIED SUBGOALS, DENOTED BY SN FOR SUCCESS NUMBER, SR FOR SUCCESS RATE, T K FOR ACHIEVING THE KTH SUBGOAL OUT OF THE TOTAL 4 SUBGOALS IN EACH TASK, AND TSR FOR TASK SUCCESS RATE, NAMELY T 4.

\ in navigation performance compared to VLMap, while significantly surpassing the performance of CoW and CLIP Map. VLMap achieves zero-shot navigation by smoothly performing precise localization of landmarks. However, this baseline has limitations as it can only navigate to the nearest category to the robot agent, lacking the capability for precise instantiation navigation. As illustrated in the comparisons between Fig.6(a) and Fig.6(b), our proposed IVLMap incorporates instantiation information for each landmark, enabling precise instantiation navigation tasks. Consequently, the final navigation performance is significantly enhanced. During specific localization, our approach involves initially identifying the approximate region of the landmark from the U and V matrices of IVLMap M(Sec.III-A). Subsequently, further refinement is conducted using VLMap, optimizing the performance of VLMap and resulting in a significant improvement in navigation accuracy

\ Fig. 6. Semantic segmentation results

\ TABLE IIOUTCOME OF ZERO-SHOT INSTANCE LEVEL OBJECT GOAL NAVIGATION FROM NATURAL LANGUAGE. T K FOR ACHIEVING THE KTH SUBGOAL OUT OF THE TOTAL 4 SUBGOALS IN EACH TASK.

\ D. Zero-Shot Instance Level Object Goal Navigation from Natural Language

\ In these experiments, we assess IVLMaps’ performance in comparison to alternative baselines concerning zero-shot instance-level object goal navigation initiated by natural language instructions. Our benchmark comprises 36 trajectories across four scenes, each accompanied by manually provided language instructions for evaluation purposes. In each language instruction, we provide instantiation and color information for navigation subgoals using natural language, such as ”the first yellow sofa”, ”in between the chair and the sofa” or ”east of the red table.” Leveraging LLM, the robot agent extracts this information for localization and navigation. Each trajectory comprises four subgoals, and successful navigation to the proximity of a subgoal within a threshold range (set at 1m) is considered a success.

\ Fig. 7. Zero shot navigation diagram, where green dot represents the starting point and red dot represents the endpoint.

\ Analysis of Table.II reveals that in zero-shot level object goal navigation, our navigation accuracy is hardly affected. This is attributed to the initial parsing of natural language instructions using LLM, enabling precise extraction of physical attributes, ensuring robust performance in navigation. Moreover, as depicted in partial trajectory schematics of navigation tasks in Fig.7, our IVLMap achieves precise instance-level object navigation globally, a capability unmatched by other baselines.

V. CONCLUSION

In this study, we introduce the Instance Level Visual Language Map (IVLMap), elevating navigation precision through instance-level and attribute-level semantic language instructions. Our approach is designed to enhance applicability in real-life scenarios, showing promising results in initial realworld robot applications. However, the mapping performance in dynamic environments requires improvement, prompting the exploration of real-time navigation using laser scanners. Our future goals include advancing towards 3D semantic maps to enable dynamic perception of object height, contributing to more accurate spatial navigation. Ongoing research efforts will focus on addressing these challenges.

ACKNOWLEDGMENT

The work is supported by the National Natural Science Foundation of China (no. 61601112). It is also supported by the Fundamental Research Funds for the Central Universities and DHU Distinguished Young Professor Program.

REFERENCES

[1] Z. Fu, T. Z. Zhao, and C. Finn, “Mobile aloha: Learning bimanual mobile manipulation with low-cost whole-body teleoperation,” in arXiv, 2024.

\ [2] P. Anderson, Q. Wu, D. Teney, J. Bruce, M. Johnson, N. Sunderhauf, ¨ I. Reid, S. Gould, and A. Van Den Hengel, “Vision-and-language navigation: Interpreting visually-grounded navigation instructions in real environments,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 3674–3683.

\ [3] J. Gu, E. Stefani, Q. Wu, J. Thomason, and X. E. Wang, “Vision-andlanguage navigation: A survey of tasks, methods, and future directions,” arXiv preprint arXiv:2203.12667, 2022.

\ [4] D. Shah, B. Osinski, S. Levine et al., “Lm-nav: Robotic navigation with large pre-trained models of language, vision, and action,” in Conference on Robot Learning. PMLR, 2023, pp. 492–504.

\ [5] C. Huang, O. Mees, A. Zeng, and W. Burgard, “Visual language maps for robot navigation,” in 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2023, pp. 10 608–10 615.

\ [6] Y. Qi, Q. Wu, P. Anderson, X. Wang, W. Y. Wang, C. Shen, and A. v. d. Hengel, “Reverie: Remote embodied visual referring expression in real indoor environments,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 9982–9991.

\ [7] F. Zhu, X. Liang, Y. Zhu, Q. Yu, X. Chang, and X. Liang, “Soon: Scenario oriented object navigation with graph-based exploration,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 12 689–12 699.

\ [8] Z. Wang, J. Li, Y. Hong, Y. Wang, Q. Wu, M. Bansal, S. Gould, H. Tan, and Y. Qiao, “Scaling data generation in vision-and-language navigation,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 12 009–12 020.

\ [9] K. He, Y. Huang, Q. Wu, J. Yang, D. An, S. Sima, and L. Wang, “Landmark-rxr: Solving vision-and-language navigation with finegrained alignment supervision,” Advances in Neural Information Processing Systems, vol. 34, pp. 652–663, 2021.

\ [10] A. Kirillov, E. Mintun, N. Ravi, H. Mao, C. Rolland, L. Gustafson, T. Xiao, S. Whitehead, A. C. Berg, W.-Y. Lo et al., “Segment anything,” arXiv preprint arXiv:2304.02643, 2023.

\ [11] K. Fukushima, “Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position,” Biological cybernetics, vol. 36, no. 4, pp. 193–202, 1980.

\ [12] P. Estefo, J. Simmonds, R. Robbes, and J. Fabry, “The robot operating system: Package reuse and community dynamics,” Journal of Systems and Software, vol. 151, pp. 226–242, 2019.

\ [13] R. F. Salas-Moreno, R. A. Newcombe, H. Strasdat, P. H. Kelly, and A. J. Davison, “Slam++: Simultaneous localisation and mapping at the level of objects,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2013, pp. 1352–1359.

\ [14] J. McCormac, R. Clark, M. Bloesch, A. Davison, and S. Leutenegger, “Fusion++: Volumetric object-level slam,” in 2018 international conference on 3D vision (3DV). IEEE, 2018, pp. 32–41.

\ [15] B. Chen, F. Xia, B. Ichter, K. Rao, K. Gopalakrishnan, M. S. Ryoo, A. Stone, and D. Kappler, “Open-vocabulary queryable scene representations for real world planning,” in 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2023, pp. 11 509–11 522.

\ [16] S.-H. Zhang, R. Li, X. Dong, P. Rosin, Z. Cai, X. Han, D. Yang, H. Huang, and S.-M. Hu, “Pose2seg: Detection free human instance segmentation,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 889–898.

\ [17] Y. Lee and J. Park, “Centermask: Real-time anchor-free instance segmentation,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 13 906–13 915.

\ [18] Y. Long, X. Li, W. Cai, and H. Dong, “Discuss before moving: Visual language navigation via multi-expert discussions,” arXiv preprint arXiv:2309.11382, 2023.

\ [19] Z. Jia, K. Yu, J. Ru, S. Yang, and S. Coleman, “Vital information matching in vision-and-language navigation,” Frontiers in Neurorobotics, vol. 16, p. 1035921, 2022.

\ [20] A. B. Vasudevan, D. Dai, and L. Van Gool, “Talk2nav: Long-range vision-and-language navigation with dual attention and spatial memory,” International Journal of Computer Vision, vol. 129, pp. 246–266, 2021.

\ [21] P. Chen, D. Ji, K. Lin, R. Zeng, T. Li, M. Tan, and C. Gan, “Weaklysupervised multi-granularity map learning for vision-and-language navigation,” Advances in Neural Information Processing Systems, vol. 35, pp. 38 149–38 161, 2022.

\ [22] J. Krantz, S. Banerjee, W. Zhu, J. Corso, P. Anderson, S. Lee, and J. Thomason, “Iterative vision-and-language navigation,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 14 921–14 930.

\ [23] G. Zhou, Y. Hong, and Q. Wu, “Navgpt: Explicit reasoning in visionand-language navigation with large language models,” arXiv preprint arXiv:2305.16986, 2023.

\ [24] R. Schumann, W. Zhu, W. Feng, T.-J. Fu, S. Riezler, and W. Y. Wang, “Velma: Verbalization embodiment of llm agents for vision and language navigation in street view,” arXiv preprint arXiv:2307.06082, 2023.

\ [25] S. Vemprala, R. Bonatti, A. Bucker, and A. Kapoor, “Chatgpt for robotics: Design principles and model abilities. 2023,” Published by Microsoft, 2023.

\ [26] D. Driess, F. Xia, M. S. Sajjadi, C. Lynch, A. Chowdhery, B. Ichter, A. Wahid, J. Tompson, Q. Vuong, T. Yu et al., “Palm-e: An embodied multimodal language model,” arXiv preprint arXiv:2303.03378, 2023.

\ [27] B. Li, K. Weinberger, S. Belongie, V. Koltun, and R. Ranftl, “Languagedriven semantic segmentation,” 2023.

\ [28] P. P. Ray, “Chatgpt: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope,” Internet of Things and Cyber-Physical Systems, 2023.

\ [29] H. Touvron, L. Martin, K. Stone, P. Albert, A. Almahairi, Y. Babaei, N. Bashlykov, S. Batra, P. Bhargava, S. Bhosale et al., “Llama 2: Open foundation and fine-tuned chat models,” arXiv preprint arXiv:2307.09288, 2023.

\ [30] S. Y. Gadre, M. Wortsman, G. Ilharco, L. Schmidt, and S. Song, “Clip on wheels: Zero-shot object navigation as object localization and exploration,” arXiv preprint arXiv:2203.10421, vol. 3, no. 4, p. 7, 2022.

\ [31] M. Ahn, A. Brohan, N. Brown, Y. Chebotar, O. Cortes, B. David, C. Finn, C. Fu, K. Gopalakrishnan, K. Hausman et al., “Do as i can, not as i say: Grounding language in robotic affordances,” arXiv preprint arXiv:2204.01691, 2022.

\ [32] A. Zeng, M. Attarian, B. Ichter, K. Choromanski, A. Wong, S. Welker, F. Tombari, A. Purohit, M. Ryoo, V. Sindhwani et al., “Socratic models: Composing zero-shot multimodal reasoning with language,” arXiv preprint arXiv:2204.00598, 2022.

\ [33] J. Liang, W. Huang, F. Xia, P. Xu, K. Hausman, B. Ichter, P. Florence, and A. Zeng, “Code as policies: Language model programs for embodied control,” in 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2023, pp. 9493–9500.

\ [34] E. Frantar, S. Ashkboos, T. Hoefler, and D. Alistarh, “Gptq: Accurate post-training quantization for generative pre-trained transformers,” arXiv preprint arXiv:2210.17323, 2022.

\ [35] M. Savva, A. Kadian, O. Maksymets, Y. Zhao, E. Wijmans, B. Jain, J. Straub, J. Liu, V. Koltun, J. Malik et al., “Habitat: A platform for embodied ai research,” in Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 9339–9347.

\ [36] R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, “Grad-cam: Visual explanations from deep networks via gradient-based localization,” in Proceedings of the IEEE international conference on computer vision, 2017, pp. 618–626.

\ [37] R. Schumann and S. Riezler, “Analyzing generalization of vision and language navigation to unseen outdoor areas,” arXiv preprint arXiv:2203.13838, 2022.

\ [38] A. Dai, A. X. Chang, M. Savva, M. Halber, T. Funkhouser, and M. Nießner, “Scannet: Richly-annotated 3d reconstructions of indoor scenes,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 5828–5839.

\ [39] A. Geiger, P. Lenz, C. Stiller, and R. Urtasun, “Vision meets robotics: The kitti dataset,” The International Journal of Robotics Research, vol. 32, no. 11, pp. 1231–1237, 2013.

\ [40] C. Cao, H. Zhu, F. Yang, Y. Xia, H. Choset, J. Oh, and J. Zhang, “Autonomous exploration development environment and the planning algorithms,” in 2022 International Conference on Robotics and Automation (ICRA). IEEE, 2022, pp. 8921–8928.

\

:::info Authors:

(1) Jiacui Huang, Senior, IEEE;

(2) Hongtao Zhang, Senior, IEEE;

(3) Mingbo Zhao, Senior, IEEE;

(4) Wu Zhou, Senior, IEEE.

:::


:::info This paper is available on arxiv under CC by 4.0 Deed (Attribution 4.0 International) license.

:::

[4] Socket.IO is a real-time communication protocol built on WebSocket, providing event-driven bidirectional communication for seamless integration of interactive features in web applications, official website https://socket.io/.

Market Opportunity
RealLink Logo
RealLink Price(REAL)
$0.05309
$0.05309$0.05309
-1.08%
USD
RealLink (REAL) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Will SEC Approve T. Rowe’s XRP-Inclusive Crypto ETF?

Will SEC Approve T. Rowe’s XRP-Inclusive Crypto ETF?

SEC to decide by Feb. 26, 2026 on NYSE Arca’s proposal to list T. Rowe Price’s Active Crypto ETF, which includes XRP exposure. The U.S. Securities and Exchange
Share
LiveBitcoinNews2026/02/19 13:00
What Are Crypto Narratives? Top 9 Narratives for 2026

What Are Crypto Narratives? Top 9 Narratives for 2026

Cryptsy - Latest Cryptocurrency News and Predictions Cryptsy - Latest Cryptocurrency News and Predictions - Experts in Crypto Casinos The world of cryptocurrency
Share
Cryptsy2026/02/19 13:09
August Crypto Market Review: ETH Leads the Rise, Institutional Funding and Macro Factors Dominate Market Trends

August Crypto Market Review: ETH Leads the Rise, Institutional Funding and Macro Factors Dominate Market Trends

By Jianing Wu , Galaxy Digital Compiled by Tim, PANews August saw various crossover signals between the macro economy and the crypto market. In traditional markets, investors faced conflicting inflation signals: the CPI released at the beginning of the month came in below expectations, but the subsequent Producer Price Index (PPI) came in above expectations. This was coupled with weakening employment data and growing market expectations that the Federal Reserve would begin cutting interest rates in September. At the end of the month's Fed meeting in Jackson Hole, Wyoming, Chairman Powell struck a dovish tone, emphasizing the "shifting balance of risks" brought about by rising unemployment, which reinforced expectations of a shift toward easing monetary policy. The stock market closed higher in a volatile session, with the S&P 500 fluctuating with the data releases. Defensive assets like gold outperformed at the end of the month. The crypto market reflected this macro uncertainty, with increased volatility. Bitcoin hit an all-time high of over $124,000 in mid-August before retreating to around $110,000, while Ethereum's gains for the entire month outpaced Bitcoin's. After experiencing its largest single-day outflow at the beginning of the month, Ethereum ETFs quickly attracted strong inflows, briefly surpassing Bitcoin's despite Ethereum's smaller market capitalization. However, the recovery in demand pushed ETH prices to a new high near $4,953, and the ETH/BTC exchange rate rose to 0.04 for the first time since November 2024. The fluctuations in ETF trading highlight that institutional position adjustments are increasingly influencing price trends, and ETH is clearly the leader in this cycle. In terms of laws and policies, regulators are gradually pushing forward reforms to reshape the industry landscape. The U.S. Department of Labor has opened the door to allocating crypto assets to 401(k) pension plans, while the U.S. SEC has explicitly stated that certain liquidity pledge businesses do not fall under the category of securities. Application trends at the market structure and institutional levels are deepening. Treasury Secretary Bessant disclosed for the first time that strategic Bitcoin reserves now hold between 120,000 and 170,000 coins, revealing the government's cumulative cryptocurrency holdings for the first time. Business activity is also accelerating: Stablecoin issuers Stripe and Circle announced plans to develop independent L1 blockchains, while Wyoming became the first state government in the US to issue a dollar-denominated stablecoin. Google also joined the enterprise blockchain fray with its "Universal Ledger" system. Meanwhile, crypto treasury companies continue to increase their asset allocation efforts. Overall, August reinforced two key trends. On the one hand, macro volatility and policy uncertainty triggered significant market volatility in both the equity and crypto markets; on the other, the underlying trend of market institutionalization is accelerating, from ETF flows to widespread adoption by sovereign institutions and corporations. These intertwining forces are likely to continue to dominate market movements as the autumn approaches, with the Federal Reserve's policy shift and ongoing structural demand likely setting the tone for the next phase of the cycle. 1. Spikes, Breakouts, and Reversals In the first half of August, Ethereum led the market, outperforming Bitcoin and driving a broad rally in altcoins. The Bloomberg Galaxy Crypto Index shows that Bitcoin hit an all-time high of $124,496 on August 13 before reversing course, closing the month at $109,127, down from $116,491 at the beginning of the month. A week later, on August 22, Ethereum broke through the previous cycle high, reaching $4,953, surpassing the November 2021 high of $4,866 and ending a four-year consolidation. Ethereum's strong performance is particularly noteworthy given its underperformance for much of this cycle. Since its April low near $1,400, the price of Ether has more than tripled, driven by strong ETF flows and purchases by crypto treasury firms. U.S. spot Ethereum ETFs saw net inflows of approximately $4 billion in August, the second-strongest month after July. In contrast, U.S. spot Bitcoin ETFs saw net outflows of approximately $639 million. However, despite a price decline in the last two weeks of August, Bitcoin ETF inflows turned positive. As market expectations for aggressive interest rate cuts from the Federal Reserve grew, Bitcoin's store-of-value narrative regained focus. As the likelihood of a rate cut increased, Bitcoin's correlation with gold strengthened significantly that month. Besides ETFs, crypto treasury firms remain a significant source of demand. These firms continued to increase their holdings throughout August, with Ethereum-focused treasuries in particular injecting significant capital. Because Ethereum's market capitalization is smaller than Bitcoin's, corporate capital inflows have a disproportionate impact on spot prices. A $1 billion allocation to Ethereum can significantly impact the market landscape, far more than a similar amount allocated to Bitcoin. Furthermore, significant funds remain undeployed among publicly disclosed crypto treasury firms, suggesting further positive market conditions. The total cryptocurrency market capitalization climbed to a record high of $4.2 trillion that month, demonstrating the deep correlation between crypto assets and broader market trends. Rising expectations of interest rate cuts boosted risk appetite in both the stock and crypto markets, while ETF inflows and corporate reserve accumulation directly contributed to record highs for BTC and ETH. Despite market volatility near the end of the month, the interplay of loose macro policies, institutional capital flows, and crypto treasury reserve needs has maintained the crypto market's central position in the risk asset narrative. 2. Each company launches its own L1 public chain Favorable regulations are giving businesses more confidence to enter the crypto market directly. In late July, US SEC Chairman Paul Atkins announced the launch of "Project Crypto," an initiative aimed at promoting the on-chain issuance and trading of stocks, bonds, and other financial instruments. This initiative marks a key step in the integration of traditional market infrastructure with blockchain technology. Encouraged by this, businesses are breaking through the limitations of existing blockchain applications and launching their own Layer 1 networks. In August, three major companies announced the launch of new L1 blockchains. Circle launched Arc, which is compatible with the EVM and uses its USDC stablecoin as its native gas token. Arc features compliance and privacy features, a built-in on-chain foreign exchange settlement engine, and will launch with a permissioned validator set. Following its acquisitions of stablecoin infrastructure provider Bridge and crypto wallet service provider Privy, Stripe launched Tempo Chain, also compatible with the EVM and focused on stablecoin payments and enterprise applications. Google released the Google Cloud Universal Ledger (GCUL), a private permissioned blockchain focused on payments and asset issuance. It supports Python-based smart contracts and has attracted CME Group as a pilot partner. The logic behind enterprise blockchain development boils down to value capture, control, and independent design. By owning the underlying protocol, companies like Circle avoid paying network fees to third parties and profit directly from transaction activity. Stripe, on the other hand, can more tightly integrate its proprietary blockchain with payment systems, developing new features for customers without relying on the governance mechanisms of other chains. Both companies view control as a key element of compliant operations, particularly as regulators increase their scrutiny of illicit financial activities. Choosing to build on L1 rather than L2 avoids being constrained by other blockchain networks in terms of settlement or consensus mechanisms. Reactions from the crypto-native community have been mixed. Many believe that projects like Arc and GCUL, while borrowing technical standards from existing L1 chains, are inferior in design and exclude Ethereum and other native assets. Critics point out that permissioned validators and corporate-led governance models undermine decentralization and user autonomy. These debates echo the failed wave of "enterprise blockchains" in the mid-2010s, which ultimately failed to attract real users. Despite skepticism, these companies' moves are significant. Stripe processes over $1 trillion in payments annually, holding approximately 17% of the global payment processing market. If Tempo can achieve lower costs or offer better developer tools, competitors may be forced to follow suit. Google's entry demonstrates that major tech companies view blockchain as the next evolutionary level of financial infrastructure. If these companies can bring their scale, distribution capabilities, and regulatory resources to this area, the impact could be profound. In addition to businesses launching their own Layer 1 chains, other developments reinforce the trend of economic activity migrating on-chain. U.S. Secretary of Commerce Lutnick announced that GDP data will be published on public blockchains via oracle networks such as Chainlink and Python. Galaxy tokenized its shares to test on-chain secondary market trading. These initiatives demonstrate that businesses and governments are beginning to embed blockchain technology into core financial and data infrastructure, despite ongoing debate over the appropriate balance between compliance and decentralization. 3. Hot Trend: Crypto Treasury Companies The crypto treasury trends we highlighted in our earlier report continue. Bitcoin, Ethereum, and Solver (SOL) holdings continue to accumulate, with Ethereum showing the strongest performance. Holdings data shows a sharp rise in ETH's crypto treasury throughout August, primarily driven by Bitmine's reserves, which increased from approximately 625,000 ETH at the beginning of August to over 2 million currently. Solver holdings also maintained steady growth, while BTC holdings continued their slower but steady accumulation. Compared to ETF fund flows, the activity of crypto treasury companies appears relatively flat. In July and August, ETF fund inflows were stronger than those of crypto treasury companies, and the cumulative balance of ETFs also exceeded the cumulative size of crypto treasury companies. This divergence is becoming increasingly apparent as premiums on crypto treasury stocks shrink across the board. Earlier this summer, price-to-earnings ratios for crypto treasury companies were significantly higher than their net asset values, but these premiums have gradually returned to more normal levels, signaling a growing caution among stock market investors. The stock price fluctuations are evident: KindlyMD (Nakamoto's parent company) has fallen from a peak of nearly $25 in late May to around $5, while Bitmine has fallen from $62 in early August to around $46. Selling pressure intensified in late August amid reports that Nasdaq may tighten its oversight of acquisitions of crypto treasury companies through stock offerings. This news accelerated the sell-off in shares of Ethereum-focused crypto treasury companies. Bitcoin-focused companies, such as Strategy (formerly MicroStrategy, ticker symbol: MSTR), were less affected because their acquisition strategies rely more on debt financing than equity issuance. 4. Hot Trend: Copycat Season Another hot trend is the rotation into altcoins. Bitcoin's dominance has gradually declined, from approximately 60% at the beginning of August to 56.5% by the end of the month, while Ethereum's market share has risen from 11.7% to 13.6%. Data indicates a rotation out of Bitcoin into Ethereum and other cryptocurrencies, which aligns with the outperformance of Ethereum ETFs and inflows into crypto treasury firms. While Bitcoin ETF inflows have rebounded in recent weeks, the overall trend remains unchanged: this cycle continues to expand beyond Bitcoin, with Ethereum and altcoins gaining incremental market share. 5. Our views and predictions As markets head into the final weeks of September, all eyes are on the Federal Reserve. Labor market weakness is solidifying expectations of a near-term rate cut and reinforcing risk assets. The jobs report underscores that the economic slowdown may be deeper than initially reported, raising questions about how much easing policy will be needed to cushion the economy. Meanwhile, the long end of the yield curve is flashing warning signs. Persistently high 10-year and 30-year Treasury yields reflect market concerns that inflation may be sticky and that fiscal pressures may ultimately force central banks to finance debt and spending through money printing. Expectations of short-term interest rate cuts are driving a rebound in risky assets, but the tug-of-war between short-term support from rate cuts and long-term concerns pushing yields and precious metals higher will determine the sustainability of this rebound. This conflicting dynamic has a direct impact on cryptocurrencies: Bitcoin's correlation with gold as a store of value and hedge is growing, while Ethereum and altcoins remain more sensitive to shifts in overall risk appetite.
Share
PANews2025/09/18 17:40