首页 \ 问答 \ 在大图像中找到小的部分透明图像的坐标(find coordinates of small partially-transparent image within a large image)

在大图像中找到小的部分透明图像的坐标(find coordinates of small partially-transparent image within a large image)


我的目标是在更大的信息图像中找到小corner images的坐标。 使用这些坐标,我想确定特定的“感兴趣区域”,以便我能够在呈现图像时聚焦这些区域或从图像中剪切这些区域。

描述的corner images看起来像这些(请忽略蓝色数字,因为它们只是注释。我想使用透明的灰色PNG图形):

这些corner images放置在大图像中的某些位置:
每组corner icons定义“感兴趣区域”(如何确定多个左上角中的哪一个属于哪个区域将是另一个问题)。

我想将一个免费/开源库与命令行操作或Python接口一起使用。 似乎ImageMagick库看起来非常接近可能的解决方案。 但是如果解决这个问题,任何其他技术都可以。

使用例如ImageMagick库,我想顺序检查corner items (它们只是同一原始图像的90度旋转),并将它们与大图像“比较”。 在大图像中, corner items可能“非常接近”其他图形内容。 因此,简单的逐块身份测试可能会失败。 “透明度”应被视为“不关心”。


I am looking for a way to programmatically (without using any graphical user interface) find the coordinates of a small image within a large image.

My aim is to find the coordinates of small corner images within a larger information image. With these coordinates I want to determine specific "regions of interest" so that I would be able to focus on these regions while presenting the image or cut these regions from the image.

The described corner images could look like these (please ignore the blue numbers, as they are only comments. I'd like to use gray PNG graphics with transparency):
enter image description here
1 top left corner
2 bottom left corner
3 bottom right corner
4 top right corner

These corner images are placed at certain positions within the large image:
enter image description here
Each set of corner icons defines a "region of interest" (how to determine which of the multiple top left corners belongs to which region would be another issue, though).

I'd like to use a free/open source library together with command line operation or a Python interface. It seems that the ImageMagick library looks pretty close to a possible solution. But any other technology would be fine if it solves this problem.

With e.g. the ImageMagick library I would like to check the four corner items sequentially (they are just 90 degree rotations of the same original image) and "compare" each of them with the large image. Within the large image, it would be more than possible that the corner items reside "very near" to other graphical content. So, a simple block-wise identity testing would possibly fail. "Transparency" should be considered as "don't care".

What tool and process could I use to get the absolute x- and y-positions of such a small image within a larger one?

更新时间:2020-05-09 10:20




  • jsPlumb draggable element javascript函数(jsPlumb draggable element javascript function)
  • MVC4:ViewModel(带有radiobuttonlist)在HttpPost之后为空(MVC4: ViewModel (with radiobuttonlist) is empty after HttpPost)
  • 如何在同一帐户上设置“Dev repo”(在prod和团队之间)(How to set up a “Dev repo” (between the prod and the team) on the same account)
  • 如何在tcl中将eth0配置为发送方udp端口(how to configure eth0 as a sender udp port in tcl)
  • 如何在datarow []中的列中找到最大值?(How to find max value in a column in a datarow[] ?)
  • 如何使用预定义文本替换来自数据库的部分结果(How do I replace part of result coming from Database with predefined text)
  • Selenium Java注入了新的Javascript函数(Selenium Java inject new Javascript function)
  • 使用.on的多个下拉菜单选择文本仅适用于第一个下拉列表(Multiple Dropdowns Menu Selection text using .on works only on first dropdown)
  • 快速将黄土曲线添加到大型数据集图中的方法(Quick way to add loess curve to large data set graph)
  • FilteringSelect in mvc(FilteringSelect in mvc)
  • 在Delphi XE2中开发Mac或iOS应用程序需要哪些硬件/软件?(What hardware/software is necessary to develop Mac or iOS apps in Delphi XE2?)
  • 在原型的构造函数中初始化属性时获取“未定义”(Getting 'undefined' when a property is initialized in the constructor of a prototype)
  • 通过越狱加载的应用程序的Documents文件夹位置(Location of Documents folder for an app loaded via jailbreak)
  • 在OpenGL中使用可编程和固定管道功能(Using both programmable and fixed pipeline functionality in OpenGL)
  • 将任何用户输入重定向到单独的底层程序(redirect any user input to a separate underlying program)
  • 编辑文本不能正常工作android(Edit texts not working properly android)
  • “user_denied”Facebook应用页面上的Facebook用户区域设置(Facebook user locale on “user_denied” facebook app page)
  • 在大图像中找到小的部分透明图像的坐标(find coordinates of small partially-transparent image within a large image)
  • 我如何在cakephp 3.1中获得完整的相对路径?(How i can get full relative path of image in cakephp 3.1?)
  • 如何保存拖动标记的新本地化?(How to save new localization of dragged marker?)
  • 在执行查询之前,在SQLAlchemy模型中将datetime转换为unix时间戳?(Convert datetime to unix timestamp in SQLAlchemy model before executing query?)
  • OpenCL与OpenGL互操作的优势(Advantage of OpenCL interoperability with OpenGL)
  • 如何解析用点和等分隔的数据然后添加到listview(How to parsing data from delimited with dot and equal then add to listview)
  • 带调试输出的X3解析器段错误(BOOST_SPIRIT_X3_DEBUG)(X3 parser segfaults with debug output (BOOST_SPIRIT_X3_DEBUG))
  • 将文件夹名称添加到fgrep结果(Add folder name to fgrep result)
  • 在MySQL中加载一个表是非常慢的(Loading one table in MySQL is ridiculously slow)
  • 如何将JSON放入PHP变量?(How do I put JSON into a PHP Variable?)
  • 如何绕过Microsoft.Speech.Recognition中的不流畅?(How to bypass disfluencies in Microsoft.Speech.Recognition?)
  • 原点的最后一行是什么?(What is the last row of an origin for?)