


AIGC has a new magic!
No need for animators' hand K, habit capture or light capture, just provide a video, this AI motion capture software can automatically output the action. In just a few minutes, the animation of the virtual human is completed.
Not only the large-frame movements of the limbs, but also the details of the hands can be accurately captured.
In addition to single-view video, it can also support multiple-view videos. Compared with other motion capture software that only supports monocular recognition, this software can provide higher motion capture quality.
At the same time, the software also supports editing and modification of recognized human body key points, smoothness, footstep details, etc. It can satisfy everything from the interest experience of ordinary players to the professional needs of hardcore players.
This is AIxPose, which has been developed by NetEase Interactive Entertainment AI Lab for many years, iteratively optimized and low-key based on professional art feedback. Video motion capture software. It is reported that the software has processed more than dozens of hours of video resources and has been used in the production process of game plot animations, popular dance animations and other resources. It has been verified by actual projects that a 1-minute dance animation may take more than 20 days to be produced by hand, but it only takes 3 days to produce with AIxPose assistance, and the entire process is shortened by more than 80%.
Recently, NetEase Interactive Entertainment AI Lab compiled the paper "Learning Analytical Posterior Probability" based on its experience in developing this software and related research work in the field of motion capture. for Human Mesh Recovery" was accepted by CVPR 2023, the top computer vision conference.
- Home page address: https://netease-gameai.github.io/ProPose/
- Paper address: https://netease-gameai.github.io/ProPose/static/assets/CVPR2023_ProPose.pdf
This paper innovatively proposes ProPose, a video motion capture technology based on posterior probability, which can achieve accurate three-dimensional human pose estimation under different settings such as single image and multi-sensor fusion. Technical accuracy is 19% higher than baseline probabilistic methods using priors, and outperforms past methods on the public datasets 3DPW, Human3.6M, and AGORA. In addition, for multi-sensor fusion tasks, this technology can also achieve higher accuracy than the baseline model without modifying the backbone of the neural network due to the introduction of new sensors.
Technical Background
The task of this research is to predict human mesh recovery (hmr) from RGB images. The existing methods can be summarized into two Category: direct method and indirect method. The direct method uses a neural network to regress the rotational representation of human joints end-to-end (such as axis angle, rotation matrix, 6D vector, etc.), while the indirect method first predicts some intermediate representations (such as three-dimensional key points, segmentation, etc.), and then passes these intermediate Indicates that the joint rotation is obtained.
However, both types of methods have some problems. For direct methods, since this type of method requires the network to directly learn abstract representations such as rotation, compared with learning key points and segmentation, learning rotation is relatively difficult, so the results output by the network are sometimes difficult to align with the image and cannot be completed. Some large movements, such as the right foot in the first row in the picture below (a) cannot be fully extended back. In contrast, indirect methods generally produce higher accuracy, but the performance of such methods relies heavily on the accuracy of the intermediate representation. When the intermediate representation produces errors due to noise, it is easy for the final rotation to appear quite obvious. error, as shown in the left hand side of the second line in (b) below.
In addition to the aforementioned deterministic methods, there are also some methods to model the uncertainty of human posture by learning certain probability distributions, thereby Take noise into account to improve system robustness. Currently, the main probability modeling methods include multivariate Gaussian distribution, normalized flow, neural network implicit modeling, etc., but these probability distributions on non-SO (3) cannot truly reflect the uncertainty of joint rotation. For example, when the uncertainty is large, the local linearity assumption of the Gaussian distribution on SO (3) does not hold. A recent work directly uses the network to learn the parameters of the matrix Fisher distribution. Although this is a distribution on SO (3), the learning method of this method is similar to the direct method, and the convergence performance cannot be compared with the existing indirect method. .
In order to take into account both high accuracy and robustness and improve the performance of probabilistic methods, ProPose derives the analytical posterior probability of joint rotation, which can not only benefit from the changes brought by different observation variables With high accuracy, it can also measure uncertainty and reduce the impact of noise on the algorithm as much as possible. As shown in the figure below, for the input image, ProPose can measure the uncertainty of the joint rotation in various directions to a certain extent through the output probability distribution, such as the rotation of the right hand along the arm axis, the direction of the left arm swinging up and down, and the left calf. The degree of distance, etc.
Technical implementation
Human body modeling
##This study conducts probability construction of human posture module, the goal is to find the posterior probability p (R|d,⋯) of joint rotation R under some observed variables (such as bone orientation d, etc.).
Specifically, since the joint rotation of the human body is located on SO (3), and the unit bone orientation of the child joint relative to the parent joint is located on S^2, it can be based on these two Analyze the probability distribution on a manifold.
First of all, the matrix Fisher distribution MF (⋅) on SO (3) can be used as the prior distribution of the joint rotation R, as shown in the following formula, F∈R^(3×3 ) are the parameters of the distribution, c (F) is a normalizing constant, and tr represents the trace of the matrix.
As shown in the following formula, F can be directly solved for the mean M and an aggregation term that represents the degree of distribution aggregation through SVD decomposition K. Among them, Δ=diag (1,1,|UV|) is a diagonal orthogonal matrix, which is used to ensure that the determinant of M is 1, so that it can fall in the special orthogonal group.
Using Bayesian theory, given the prior distribution p (R) and the likelihood function p (d|R), the posterior distribution of the joint rotation conditional on the bone orientation can be calculated. The analytical form of the posterior probability p (R|d):
The above posterior probability only considers the orientation of the human skeleton as an observation quantity. Similarly, it can also be extended to other direction observation quantities d_i or rotation observation quantities D_j (which can be generated by other sensors) , such as IMUs, etc.), the analytical posterior probability is obtained in the following general form:
where κ_i and K_j are aggregation terms. g (⋅) is a mapping in the form of IK, which can convert direction observations into rotation estimates. It can adopt the simplest form such as g (d_i)=dl^T. Z_1 and Z_3 represent the set of direction observations and rotation observations respectively. Characteristics This section further explains that the posterior probability distribution has a higher probability than the prior probability distribution. degree of aggregation. The foregoing section introduces the analytical form of the posterior probability of human joint rotation, which is characterized by a new parameter F'. The posterior parameter F^' can be understood from another perspective, that is, F^' is the product of the mean term M that is the same as F and a new aggregation term K^': Where M^T dl^T=ll^T is a rank 1 real symmetric matrix, and K is also a real symmetric matrix, that is, the posterior aggregation term K' is also a real symmetric matrix. According to the staggered theorem about real symmetric matrices in matrix analysis, it can be obtained that the eigenvalues λ_i' of K' and the eigenvalues λ_i of K have the following inequality relationship: Considering that the eigenvalue of the aggregation term is equivalent to the singular value of the distribution parameter, and the singular value of the distribution parameter can reflect the confidence of the distribution, it can be concluded that when the likelihood term is non-zero, the posterior estimation ratio The prior estimate is more concentrated and can quickly converge to the mode preferred by the likelihood function, making it easier to learn. In addition to the prior probability method, another major benchmark method is to use inverse kinematics (IK) to directly calculate the rotation through the bone orientation. The following picture can intuitively show the posterior Comparison between probabilistic and deterministic IK methods. The above picture takes the human elbow joint as an example. The real three-dimensional coordinate axis represents the true value, and the transparent three-dimensional coordinate axis represents the estimated value. The first line represents the deterministic IK method. The modeling method behind this type of method is a vector representing the bone orientation. When the bone orientation is accurately estimated, the remaining one degree of freedom (twist) can be reduced to a circle (in the figure The dotted circle on the ball); when the bone orientation is estimated inaccurately, it will cause all possible estimates to deviate from the true value. The second line represents the posterior probability model of this study, which is a fusion of multiple different types of models. The red area on the sphere represents the probability of a certain rotation. Even if there is an error in the estimation of the bone orientation, this method may return it to the true state. value, because the noise of bone orientation can be mitigated as much as possible by a priori or other observations. Network framework diagram and loss function Based on the aforementioned theory and derivation, the following figure can be directly constructed frame diagram. A multi-branch network is used to estimate the prior distribution parameter F, the three-dimensional key point J (from which the bone orientation d is calculated), and the shape parameter β from a single image. The posterior probability is calculated through Bayes' rule, and finally the posture estimate can be obtained from the posterior distribution to output the human mesh. The selection of the loss function is relatively straightforward and is the weighted sum of the following four constraints, where L_J represents the key point constraint and L_β represents the shape parameter constraint. L_θ represents the attitude parameter constraint in matrix form, and L_s represents the attitude constraint after sampling the distribution. Regarding the constraints on the distribution, MAP is not used directly here because the numerical stability of the normalization parameters is considered. Regarding the sampling strategy, similar to the previous work, the matrix Fisher distribution is converted into the equivalent Bingham distribution in the quaternion form, and then obtained through rejection sampling, where the recommended distribution for rejection sampling adopts the angular central Gaussian distribution. In the experimental part, this study conducted a quantitative comparison with past methods on the public data sets Human3.6M, 3DPW, AGORA, and TotalCapture. It can be seen that the method of this study surpasses many previous methods. The last two gray rows in the table on the lower right are the work of the same period, and are listed here for the completeness of the list. Experimental results
#The following figure shows the existing SOTA Qualitative comparison of methods HybrIK, PARE, and CLIFF shows that ProPose can achieve better results in some occlusion situations.
The following table shows a series of ablation experiments, mainly demonstrating the accuracy and robustness of ProPose. The benchmark methods include not using three-dimensional key points, not using priors, not using priors during testing, selecting features at different locations in the backbone network, etc. The table on the left below fully verifies that the proposed posterior probability distribution has higher accuracy. The table on the right below shows the comparison of the robustness to noise between the posterior method and the deterministic IK method. It can be seen that the posterior method can resist the interference of noise to a greater extent.
In addition to the above hmr tasks, this research also focuses on multi-sensor fusion tasks The evaluation was carried out on the above, and the effect of a single view and IMUs fusion is given below.
The above is the detailed content of Animation production efficiency increased by 80%! This AI software realizes high-precision video motion capture with one click. For more information, please follow other related articles on the PHP Chinese website!

ai合并图层的快捷键是“Ctrl+Shift+E”,它的作用是把目前所有处在显示状态的图层合并,在隐藏状态的图层则不作变动。也可以选中要合并的图层,在菜单栏中依次点击“窗口”-“路径查找器”,点击“合并”按钮。

ai橡皮擦擦不掉东西是因为AI是矢量图软件,用橡皮擦不能擦位图的,其解决办法就是用蒙板工具以及钢笔勾好路径再建立蒙板即可实现擦掉东西。

虽然谷歌早在2020年,就在自家的数据中心上部署了当时最强的AI芯片——TPU v4。但直到今年的4月4日,谷歌才首次公布了这台AI超算的技术细节。论文地址:https://arxiv.org/abs/2304.01433相比于TPU v3,TPU v4的性能要高出2.1倍,而在整合4096个芯片之后,超算的性能更是提升了10倍。另外,谷歌还声称,自家芯片要比英伟达A100更快、更节能。与A100对打,速度快1.7倍论文中,谷歌表示,对于规模相当的系统,TPU v4可以提供比英伟达A100强1.

ai可以转成psd格式。转换方法:1、打开Adobe Illustrator软件,依次点击顶部菜单栏的“文件”-“打开”,选择所需的ai文件;2、点击右侧功能面板中的“图层”,点击三杠图标,在弹出的选项中选择“释放到图层(顺序)”;3、依次点击顶部菜单栏的“文件”-“导出”-“导出为”;4、在弹出的“导出”对话框中,将“保存类型”设置为“PSD格式”,点击“导出”即可;

Yann LeCun 这个观点的确有些大胆。 「从现在起 5 年内,没有哪个头脑正常的人会使用自回归模型。」最近,图灵奖得主 Yann LeCun 给一场辩论做了个特别的开场。而他口中的自回归,正是当前爆红的 GPT 家族模型所依赖的学习范式。当然,被 Yann LeCun 指出问题的不只是自回归模型。在他看来,当前整个的机器学习领域都面临巨大挑战。这场辩论的主题为「Do large language models need sensory grounding for meaning and u

ai顶部属性栏不见了的解决办法:1、开启Ai新建画布,进入绘图页面;2、在Ai顶部菜单栏中点击“窗口”;3、在系统弹出的窗口菜单页面中点击“控制”,然后开启“控制”窗口即可显示出属性栏。

引入密集强化学习,用 AI 验证 AI。 自动驾驶汽车 (AV) 技术的快速发展,使得我们正处于交通革命的风口浪尖,其规模是自一个世纪前汽车问世以来从未见过的。自动驾驶技术具有显着提高交通安全性、机动性和可持续性的潜力,因此引起了工业界、政府机构、专业组织和学术机构的共同关注。过去 20 年里,自动驾驶汽车的发展取得了长足的进步,尤其是随着深度学习的出现更是如此。到 2015 年,开始有公司宣布他们将在 2020 之前量产 AV。不过到目前为止,并且没有 level 4 级别的 AV 可以在市场

ai移动不了东西的解决办法:1、打开ai软件,打开空白文档;2、选择矩形工具,在文档中绘制矩形;3、点击选择工具,移动文档中的矩形;4、点击图层按钮,弹出图层面板对话框,解锁图层;5、点击选择工具,移动矩形即可。


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

SublimeText3 Linux new version
SublimeText3 Linux latest version

SublimeText3 English version
Recommended: Win version, supports code prompts!
