Measuring and predicting Carbon Emission (CE) is important to enabling the main culprit of various urgent environmental issues including global warming. However, prior studies did not fully incorporate the impact of micro-level urban streetscapes, which might lead to biased prediction of urban CE. To fill the gap, we developed an effective framework to predict residential CE in urban areas from widely existing and publicly available street-view images (SVI) using machine learning. First, we used a semantic segmentation algorithm to classify more than 30 streetscape elements from SVI images to describe the built environment whose features might affect residential and transportation CE. Second, based on the streetscapes quantified, we trained a 10-fold cross-validation method with various machine learning models to predict the CE at the 1KM grid level using CE data from the PlanetData. We found first, built environment features such as sidewalks, roads, fences, buildings, and walls are significantly correlated with the residential CE. Second, the presence of buildings and subtle streetscape features (e.g., walls, fences) indicates higher-density residential areas which are related to more residential CE. Third, vegetation (e.g., trees and grass) are reversely related to residential CE. Our findings shed light on the feasibility of using a single and open data source (i.e., the SVI) to effectively model neighborhood-level CE for regions across diverse urban forms. Our framework is useful for urban planners to inform new town development and urban regeneration towards the low CE goals.