Submitted:
02 August 2025
Posted:
04 August 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Background
2.1. 3D Object Generation
2.2. 3D Scene Generation and Editing
3. Method
- Primitives Stylizer, which takes a single-view image of a primitive and produces a stylized version guided by a text prompt;
- Mesh Generator, which converts the stylized image into a corresponding textured 3D mesh;
- Scene Integrator, which incorporates the generated mesh into the target environment.
3.1. Primitives Stylizer
3.2. Mesh Generator
3.3. Scene Integrator

4. Experiments
4.1. Evaluation Setup
4.2. Results
| Object | Primitive | Mesh | SIGNeRF (min) | Total (min) |
| -Stylization (s) | Generation (s) | |||
| Sofa | 16.7 | 30 | 28.3 | 29.1 |
| Lamp | 18.1 | 30 | 29.1 | 29.9 |
| Bed | 15.3 | 30 | 30.2 | 31.0 |
5. Conclusion
References
- K. Gao, Y. Gao, H. He, D. Lu, L. Xu, and J. Li, “Nerf: Neural radiance field in 3d vision, a comprehensive review,”. arXiv preprint, arXiv:2210.00379.2022.
- B. Kerbl, G. Kopanas, T. Leimkühler, and G. Drettakis, “3d gaussian splatting for real-time radiance field rendering,” ACM Transactions on Graphics, vol. 42, no. 4, pp. 1–14, 2023.
- Z. Wang, Y. Wang, Y. Chen, C. Xiang, S. Chen, D. Yu, C. Li, H. Su, and J. Zhu, “Crm: Single image to 3d textured mesh with convolutional reconstruction model,”. arXiv preprint, arXiv:2403.05034.2024.
- J. R. Shue, E. R. Chan, R. Po, Z. Ankner, J. Wu, and G. Wetzstein, “3d neural field generation using triplane diffusion,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 20 875–20 886.
- Y. Xu, Z. Shi, W. Yifan, H. Chen, C. Yang, S. Peng, Y. Shen, and G. Wetzstein, “Grm: Large gaussian reconstruction model for efficient 3d reconstruction and generation,”. arXiv preprint, arXiv:2403.14621.2024.
- Y.-J. Yuan, Y.-T. Sun, Y.-K. Lai, Y. Ma, R. Jia, and L. Gao, “Nerf-editing: geometry editing of neural radiance fields,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 18 353–18 364.
- B. Yang, C. Bao, J. Zeng, H. Bao, Y. Zhang, Z. Cui, and G. Zhang, “Neumesh: Learning disentangled neural mesh-based implicit field for geometry and texture editing,” in European Conference on Computer Vision. Springer, 2022, pp. 597–614.
- D. Cohen-Bar, E. Richardson, G. Metzer, R. Giryes, and D. Cohen-Or, “Set-the-scene: Global-local training for generating controllable nerf scenes,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 2920–2929.
- R. Po and G. Wetzstein, “Compositional 3d scene generation using locally conditioned diffusion,”. arXiv preprint, arXiv:2303.12218.2023.
- A. Haque, M. Tancik, A. A. Efros, A. Holynski, and A. Kanazawa, “Instruct-nerf2nerf: Editing 3d scenes with instructions,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 19 740–19 750.
- T. Brooks, A. Holynski, and A. A. Efros, “Instructpix2pix: Learning to follow image editing instructions,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 18 392–18 402.
- J.-N. Dihlmann, A. Engelhardt, and H. Lensch, “Signerf: Scene integrated generation for neural radiance fields,”. arXiv preprint, arXiv:2401.01647.2024.
- L. Zhang, A. Rao, and M. Agrawala, “Adding conditional control to text-to-image diffusion models,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 3836–3847.
- M. Tancik, E. Weber, E. Ng, R. Li, B. Yi, J. Kerr, T. Wang, A. Kristoffersen, J. Austin, K. Salahi, A. Ahuja, D. McAllister, and A. Kanazawa, “Nerfstudio: A modular framework for neural radiance field development,” in ACM SIGGRAPH 2023 Conference Proceedings, ser. SIGGRAPH ’23, 2023.






Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).