论文标题
改头换面:一个透明的深度学习框架,以美化城市场景
FaceLift: A transparent deep learning framework to beautify urban scenes
论文作者
论文摘要
在计算机视觉领域,深度学习技术最近被用来预测城市场景是否可能被认为是美丽的:事实证明,这些技术能够做出准确的预测。然而,当涉及到城市设计的可行见解时,它们缺乏。为了支持城市干预措施,人们需要超越预测美丽,并应对重现美的挑战。不幸的是,深度学习技术尚未考虑到这一挑战。鉴于它们的“黑盒本性”,这些模型不能直接用于解释为什么特定的城市场景被认为是美丽的。为了部分解决这个问题,我们提出了一个名为“ Facelift”的深度学习框架,该框架能够美化现有的城市场景(Google Street Views)并解释哪些城市元素使这些转变的场景变得美丽。为了定量评估我们的框架,我们无法求助于任何现有的指标(因为手头的研究问题从未解决过),并且需要制定新的指标。这些新的指标理想情况下应捕捉到使城市空间伟大的元素的存在/不存在。审查城市规划文献后,我们确定了五个主要指标:步行性,绿色空间,开放性,地标和视觉复杂性。我们发现,在所有五个指标中,美化的场景都符合文献对巨大空间所构成的期望。一项20个参与者的专家调查进一步证实了这一结果,在该调查中,人们发现改头换面有效地促进公民参与。所有这些都表明,将来,随着我们框架的组件得到进一步研究,变得更好,变得更加复杂,不难想象的技术将能够准确有效地支持建筑师和规划师在我们直观地爱上的空间设计中。
In the area of computer vision, deep learning techniques have recently been used to predict whether urban scenes are likely to be considered beautiful: it turns out that these techniques are able to make accurate predictions. Yet they fall short when it comes to generating actionable insights for urban design. To support urban interventions, one needs to go beyond predicting beauty, and tackle the challenge of recreating beauty. Unfortunately, deep learning techniques have not been designed with that challenge in mind. Given their "black-box nature", these models cannot be directly used to explain why a particular urban scene is deemed to be beautiful. To partly fix that, we propose a deep learning framework called Facelift, that is able to both beautify existing urban scenes (Google Street views) and explain which urban elements make those transformed scenes beautiful. To quantitatively evaluate our framework, we cannot resort to any existing metric (as the research problem at hand has never been tackled before) and need to formulate new ones. These new metrics should ideally capture the presence/absence of elements that make urban spaces great. Upon a review of the urban planning literature, we identify five main metrics: walkability, green spaces, openness, landmarks and visual complexity. We find that, across all the five metrics, the beautified scenes meet the expectations set by the literature on what great spaces tend to be made of. This result is further confirmed by a 20-participant expert survey in which FaceLift have been found to be effective in promoting citizen participation. All this suggests that, in the future, as our framework's components are further researched and become better and more sophisticated, it is not hard to imagine technologies that will be able to accurately and efficiently support architects and planners in the design of spaces we intuitively love.