Pull to refresh

Design

Show first
Rating limit

Our new public speech synthesis in super-high quality, 10x faster and more stable

Machine learning *Natural Language Processing *Voice user interfaces

hero_image


In our last article we made a bunch of promises about our speech synthesis.


After a lot of hard work we finally have delivered upon these promises:


  • Model size reduced 2x;
  • New models are 10x faster;
  • We added flags to control stress;
  • Now the models can make proper pauses;
  • High quality voice added (and unlimited "random" voices);
  • All speakers squeezed into the same model;
  • Input length limitations lifted, now models can work with paragraphs of text;
  • Pauses, speed and pitch can be controlled via SSML;
  • Sampling rates of 8, 24 or 48 kHz are supported;
  • Models are much more stable — they do not omit words anymore;

This is a truly break-through achievement for us and we are not planning to stop anytime soon. We will be adding as many languages as possible shortly (the CIS languages, English, European languages, Hindic languages). Also we are still planning to make our models additional 2-5x faster.


We are also planning to add phonemes and a new model for stress, as well as to reduce the minimum amount of audio required to train a high-quality voice to 5 — 15 minutes.


As usual you can try our model in our repo or in colab.

Read more →
Total votes 13: ↑13 and ↓0 +13
Views 922
Comments 0

Analysis of UE5 Rendering Technology: Nanite

Game development *Game testing *Game design *AR and VR Games and game consoles
Tutorial

After Epic released the UE5 technology demo at the beginning of 2021, the discussion about UE5 has never stopped. Related technical discussions mainly centered on two new features: global illumination technology Lumen and extremely high model detail technology Nanite. There have been some articles [1 ][2] analyzing Nanite technology in more detail. This article mainly starts from the RenderDoc analysis and source code of UE5, combined with some existing technical data, aims to provide an intuitive and overview understanding of Nanite, and clarify its algorithm principles and design ideas, without involving too many source code level Implementation details.

 

https://blog.en.uwa4d.com/2022/02
Rating 0
Views 1.4K
Comments 0

UE5 Lumen Implementation Analysis

Game development *Game testing *Game design *AR and VR Games and game consoles
Tutorial

Lumen is UE5’s GI system, it is different from the traditional real-time GI which only includes the contribution of indirect diffuse reflection. It also includes indirect diffuse reflection and indirect highlight, providing a new set of complete indirect lighting. Lumen supports both hardware-based RTX and software-based Trace algorithms. The starting point of this article is that Lumen GI uses the process, algorithm, and data structure analysis of indirect diffuse reflection part based on software Trace to understand the basic principle and operation mechanism of Lumen from a macro perspective.

 

The core of Lumen includes the following parts:

Read more https://blog.en.uwa4d.com/2022/0
Rating 0
Views 839
Comments 0

Unity Performance Optimization Ⅵ: Resource Memory Leak

Game development *Game testing *Game design *AR and VR Games and game consoles
Sandbox

Today, we will share some knowledge points related to resource memory leak. A memory leak is the most common issue that we continuously see and also are afraid of. What is the reason behind it? Because we can’t predict the extent of the leak before we locate the leak bottleneck, we had no idea whether it will burst out at a certain moment on the line. We have received feedback from developers that their players had no problem playing for half an hour, but they would get more and more stuck after 3 to 4 hours of playing, which they never expected before. How can it be solved? Today’s sharing will answer such questions.

UWA’s GOT Online-Assets report has a resource occupancy trend chart. If there is a rising trend like the one below, you must pay special attention.

Read more at blog dot en dot uwa4d dot com
Total votes 1: ↑0 and ↓1 -1
Views 866
Comments 0

Японский дизайн — больше, чем минимализм

Alconost corporate blog Web design *Language localisation *Graphic design *Reading room
Translation

Проблема, с которой я, как дизайнер презентаций, сталкиваюсь чаще всего, — это необходимость втиснуть большой объем информации на минимальное количество слайдов (так хочет клиент) и при этом разместить информацию эстетически привлекательно.

«Это невозможно» — думаю я, переставляя текстовые блоки, словно игрок в Тетрис. Но внезапно мне приходит мысль, что именно в этом преуспели японцы. Просто взгляните на их меню, на их журналы, брошюры или даже на улицы японских городов. 

Тем не менее, когда я ищу в Гугле словосочетание «японский дизайн», всё, что мне выдает поисковик, — это статьи про минимализм в сочетании с фото симметричных садов камней, чайных чашек нейтральных тонов и деревянных интерьеров, в которых могла бы, кажется, жить Мари Кондо.

Почему лишь такой подход является синонимом японского дизайна, если мы говорим о стране, известной одновременно магазинами MUJI и Don Don Donki, дзен-садами и хаотичной рекламой?

Читать далее
Total votes 7: ↑6 and ↓1 +5
Views 2.1K
Comments 1

Access the power of hardware accelerated video codecs in your Windows applications via FFmpeg / libavcodec

Intel corporate blog Working with video *Programming *
Tutorial
Since 2011 all Intel GPUs (integrated and discrete Intel Graphics products) include Intel Quick Sync Video (QSV) — the dedicated hardware core for video encoding and decoding. Intel QSV is supported by all popular video processing applications across multiple OSes including FFmpeg. The tutorial focuses on Intel QSV based video encoding and decoding acceleration in Windows native (desktop) applications using FFmpeg/libavcodec for video processing. To illustrate concepts described, the open source 3D Streaming Toolkit is used.
Read more →
Total votes 3: ↑3 and ↓0 +3
Views 4.5K
Comments 0

Load test of WebRTC recording on AWS

Flashphoner corporate blog System administration *Working with video *IT Infrastructure *DevOps *

Do you remember how just a few years ago it was a disaster to lose a camera at the end of a vacation? All memorable pictures and videos then disappeared along with the lost device. Probably, this fact prompted the great minds to invent cloud storage, so that the safety of records no longer depends on the presence of the devices on which these records are made.

Read more
Rating 0
Views 1.1K
Comments 0

WebRTC face to face video chat. Load test

Flashphoner corporate blog System administration *Working with video *IT Infrastructure *DevOps *

We continue to review variants of load tests. In this article we will go over the testing methodology and conduct a load test that we will use to try and determine the number of users that could watch and stream at the same time, meaning the users will simultaneously publish and view the streams.

Read more
Rating 0
Views 829
Comments 0

Load testing for WebRTC mixer

Flashphoner corporate blog System administration *Working with video *IT Infrastructure *DevOps *

This article is a continuation of our series of write-ups about load tests for our server. We have already discussed how to compile metrics and how to use them to choose the equipment, and we also provided an overview of various load testing methods. Today we shall look at how the server handles stream mixing.

Read more
Total votes 1: ↑1 and ↓0 +1
Views 529
Comments 0

Using a headless browser for WebRTC load tests

Flashphoner corporate blog System administration *Working with video *IT Infrastructure *DevOps *

In the previous article we went over a load test whose data could be used to choose a load-appropriate server. In the course of the testing, we would publish a stream on one WCS, and we would pick up that stream several times using a second WCS. The acquired results could be used as a basis for decisions on server operability.

Some would (justly) have concerns regarding the possible biases in such a test — after all, one of our servers was used to test another one of our servers. Could it be that we were using a specially optimized code that skewed the results in our favor?

Read more
Total votes 1: ↑1 and ↓0 +1
Views 1.4K
Comments 0

Choosing a server for 1000 WebRTC streams

Flashphoner corporate blog System administration *Working with video *IT Infrastructure *DevOps *

In any project, a great deal of importance is placed on the selection of server hardware and WebRTC streaming is no exception. One of the key principles of such a selection is balance – the hardware should be powerful enough to handle the streams with no drops in quality, but not too powerful so as to waste resources. So, how does one choose the right server?

Read more
Total votes 3: ↑3 and ↓0 +3
Views 1K
Comments 0

Practical uses of WebRTC Canvas streaming

Flashphoner corporate blog Website development *Working with video *Programming *Video conferencing

In this article we will once again return to the tired topic of webinars and webinar hosting tools. And no, we're not about to code a whole new system for webinar hosting – there are already plenty of those. Instead, we will talk about connecting drawing software to the webinar, so that you could manually draw and broadcast the process.

Read More
Rating 0
Views 1.7K
Comments 4

An Unexpected Article About Our Unicorn: Who Is the PVS-Studio Mascot?

PVS-Studio corporate blog Graphic design *Branding Design

PVS-Studio has a mascot that became inseparable from the brand - a unicorn. Lately we've been getting many questions about our magic steed: why the unicorn, why has he changed so much, does he have hooves, how come he doesn't wear pants, and how do we draw him. The answers are finally here, in this very article.

Attention: there will be a lot of pictures. And I mean A LOT.

Read more
Total votes 11: ↑11 and ↓0 +11
Views 780
Comments 0

The benefits of offering VoIP to your customers under your own brand

Zadarma corporate blog API *Development of communication systems *Cloud services Voice user interfaces

The potential of VoIP to your customers is simply phenomenal. Businesses are experiencing the advantages of VoIP’s cost-efficiency and reliability and now you can pass these benefits onto your own customers very easily. Cloud telecommunication is sophisticated and easily integrated. Confidence in this technology is growing fast. There has never been a better time to start talking to your customers about adopting this solution. It will deliver huge business benefits for them and has the potential to increase business income and profitability.

Read more
Total votes 4: ↑4 and ↓0 +4
Views 1.1K
Comments 0

WebRTC screen-sharing with authorization and other benefits

Flashphoner corporate blog Website development *Working with video *Programming *Video conferencing

The popularity of online education is increasing every month. And since there is an increase in popularity, there is an increase in competition and an improvement in quality: many authors of online schools realized that nowadays it's not enough to just make something mediocre using the frontal camera of the iPhone 4. You need to work on the quality of the picture, sound, presentation of material, etc. Not only did the pandemic prove that you can teach salsa and even crochet online, so it also gave an excellent push to improve the materials taught + diversity in the labor market. Moms on maternity leave can rejoice - now you can truly earn a hundred thousand million per second, sitting at home.

Read more
Total votes 2: ↑1 and ↓1 0
Views 1.4K
Comments 1