Australia

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n","post_body":"

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n\n

Imagine a world where anyone can communicate using sign language over video. Inspired by this vision, some of our engineering team decided to bring this idea to HealthHack 2018. In less than 48 hours and using the power of artificial intelligence, their team was able to produce a working prototype which translated signs from the Auslan alphabet to English text in real time.

$\"Coviu$

The problem

People who are hearing impaired are left behind in video consultations. Our customers tell us that, because they can’t sign themselves, they have to use basic text chat to hold their consults with hearing-impaired patients - a less than ideal solution. With the growing adoption of telehealth, deaf people need to be able to communicate naturally with their healthcare network, regardless of whether the practitioner knows sign language.

Achieving universal sign language translation is no easy feat. The dynamic nature of natural sign language makes it a hard task for computers, not to mention the fact that there are over 200 dialects of sign language worldwide. Speakers of American Sign Language (ASL) have been fortunate in that a number of startups and research projects are dedicated to translating ASL in real time.

In Australia however, where Auslan is the national sign language, speakers have not been so fortunate, and there is next to no work being done for the Auslan community. We thought we might be able to help.

Our solution

Our goal was a lofty one - create a web application that uses a computer’s webcam to capture a person signing the Auslan alphabet, and translate it in real time. This would involve:

Gathering data
Training a machine learning model to recognise the Auslan alphabet
Building the user interface

Building the Auslan Alphabet Image Dataset

Machine learning, a branch of artificial intelligence, is the practice of teaching computers how to learn. In general, we do this by giving computers a bunch of examples of “labelled” data - e.g. here is an image, and it is a dog - and tell the computer to find similarities about objects of the same label; a process called “supervised learning”.

So to train a machine learning model that could recognise the Auslan alphabet, we needed a bunch of images of people signing each letter in the Auslan alphabet, coupled with what English letter each photo represented. Our model also needs to learn where the hands are in the image, and for that, we need to draw bounding boxes around the hands in the image.

$\"dataset$

Example of a manually-drawn bounding box for the letter \"B\"

This kind of data exists for ASL, but as it turns out, there is no dataset of images of the Auslan alphabet.

So, we made one. We downloaded several videos from YouTube of people demonstrating the Auslan alphabet, extracted every frame from each video, and then manually drew bounding boxes over the hands to mark the letter, one frame at a time - It was exactly as much fun as it sounds! Machine learning models like diverse input data, so we also captured around 700 images of each team member signing the letters at different angles and in different lighting conditions.

Proof of concept

Now that we had some data, we wanted to see if we could actually get something working. Enter YOLO (You Only Look Once) - a popular algorithm for object detection. Clearly not his first rodeo, our team member Michael Swan (Tabcorp) was able to produce a proof-of-concept video that recognised the letters of \"HEALTH HACK\" after only a few hours of training.

Excuse the poor signing - Tom only learned the signs 10 minutes prior!

[youtube https://www.youtube.com/watch?v=DDGplO5jB4M?rel=0&showinfo=0&w=560&h=315]

Training a machine learning model

A controlled example with limited classes is great, but we wanted to see how far we could get classifying all 26 signs of the Auslan alphabet. For this, we trained our own neural network. Inspired by the mammalian brain, neural networks, in particular, convolutional neural networks, have proven to be very effective at image classification, and our problem was no exception.

Lex Toumbourou (ThoughtWorks), the team’s resident machine learning expert, trained a convolutional neural network using Pytorch (a Python machine learning framework) to predict where the hands were (each point of the bounding box), as well as the class (the letter). A few tweaks here and there and we had a model that could predict signs from the Auslan alphabet with ~86% accuracy. See Lex's research and implementation.

$\"machine$

Our machine learning model at work

The Final Push

Remember our end goal - translate the Auslan alphabet in real time. Building the first image dataset for the Auslan alphabet and having a model predicting with ~86% accuracy would have been a solid achievement for a hackathon on any given day.

But we wanted to push it even further. We wanted a demo-able application.

The final 2 components we needed were; a backend service that, given an image of a sign, would return us the predicted letter, and; a front-end that could capture and display video from the user’s webcam and ask the backend for predictions.

For the backend, we wrapped our model up in a Flask app (Python) - a POST request with the image as the payload would return us the 4 points of the bounding box and the class (or letter) of the image. On the client side, we used plain ol' Javascript to capture the users’ webcam with the browser’s getUserMedia method and using an invisible canvas we took a frame from the video every 200ms, requested a prediction, and displayed the results accordingly.

$\"HealthHack$

For those interested in the technical details, all of our code is open source on Github for everyone to use and improve.

We were able to build the first Auslan alphabet image dataset, train a machine learning model from scratch, AND make a Python web app which could translate sign language in real time- in one weekend!

Our team

We couldn’t have done it without our amazing team. We thank Lex Toumbourou (Machine Learning Engineer) and Michael Swan (Game developer) for coming together with us and building something incredible over the weekend.

Where to from here

We are super proud of what we achieved over the weekend, but it’s only just the beginning. With a bigger dataset and more tweaking of our models, we believe we could develop accurate and reliable technology for fingerspelling using Auslan.

Of course, sign language involves more than just hands and letters; it incorporates facial expressions and sequences of gestures to form full sentences. While a solution to natural sign language translation is still an open problem, we believe we pushed the needle ever so slightly towards a better experience for the hearing-impaired.

We’d love to learn more about how we can help bring telehealth to the hearing-impaired. If you’re interested in learning more, please reach out us at support@coviu.com

","rss_summary":"

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n","rss_body":"

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n\n

$\"Coviu$

The problem

Our solution

Our goal was a lofty one - create a web application that uses a computer’s webcam to capture a person signing the Auslan alphabet, and translate it in real time. This would involve:

Gathering data
Training a machine learning model to recognise the Auslan alphabet
Building the user interface

Building the Auslan Alphabet Image Dataset

$\"dataset$

Example of a manually-drawn bounding box for the letter \"B\"

This kind of data exists for ASL, but as it turns out, there is no dataset of images of the Auslan alphabet.

Proof of concept

Excuse the poor signing - Tom only learned the signs 10 minutes prior!

[youtube https://www.youtube.com/watch?v=DDGplO5jB4M?rel=0&showinfo=0&w=560&h=315]

Training a machine learning model

$\"machine$

Our machine learning model at work

The Final Push

But we wanted to push it even further. We wanted a demo-able application.

$\"HealthHack$

For those interested in the technical details, all of our code is open source on Github for everyone to use and improve.

We were able to build the first Auslan alphabet image dataset, train a machine learning model from scratch, AND make a Python web app which could translate sign language in real time- in one weekend!

Our team

Where to from here

We’d love to learn more about how we can help bring telehealth to the hearing-impaired. If you’re interested in learning more, please reach out us at support@coviu.com

","enable_google_amp_output_override":false,"generate_json_ld_enabled":true,"blog_post_schedule_task_uid":null,"blog_publish_to_social_media_task":"DONE","blog_publish_instant_email_task_uid":"DONE","blog_publish_instant_email_campaign_id":null,"blog_publish_instant_email_retry_count":null,"keywords":[],"composition_id":0,"is_crawlable_by_bots":false,"layout_sections":{},"past_mab_experiment_ids":[],"deleted_by":null,"featured_image_alt_text":"","enable_layout_stylesheets":null,"tweet":null,"tweet_at":null,"campaign_name":null,"campaign_utm":null,"meta_keywords":null,"meta_description":"We used AI to translate sign language in real-time. See how we used Python to train a neural network with 86% accuracy in less than a day.","tweet_immediately":false,"publish_immediately":false,"security_state":"NONE","placement_guids":[],"header_template_path":null,"header_variant_name":null,"footer_template_path":null,"footer_variant_name":null,"global_block_overrides":{},"property_for_dynamic_page_title":null,"property_for_dynamic_page_slug":null,"property_for_dynamic_page_meta_description":null,"property_for_dynamic_page_featured_image":null,"property_for_dynamic_page_canonical_url":null,"preview_image_src":null,"legacy_blog_tabid":null,"legacy_post_guid":null,"performable_variation_letter":null,"style_override_id":null,"has_user_changes":true,"css":{},"css_text":"","unpublished_at":0,"published_by_id":10287886,"allowed_slug_conflict":false,"ai_features":null,"link_rel_canonical_url":null,"page_redirected":false,"page_expiry_enabled":null,"page_expiry_date":null,"page_expiry_redirect_id":null,"page_expiry_redirect_url":null,"deleted_by_id":null,"state_when_deleted":null,"cloned_from":null,"staged_from":null,"personas":[],"compose_body":null,"featured_image":"https://f.hubspotusercontent30.net/hubfs/4554639/Imported_Blog_Media/franck-v-740564-unsplash.jpg","featured_image_width":4606,"featured_image_height":3588,"publish_timezone_offset":null,"theme_settings_values":null,"published_at":1597191697222,"head_html":null,"footer_html":null,"attached_stylesheets":[],"enable_domain_stylesheets":null,"include_default_custom_css":null,"header":null,"password":null,"last_edit_session_id":null,"last_edit_update_id":null,"created_by_agent":null},"metaDescription":"We used AI to translate sign language in real-time. See how we used Python to train a neural network with 86% accuracy in less than a day.","metaKeywords":null,"name":"How we used AI to translate sign language in real time","nextPostFeaturedImage":"https://f.hubspotusercontent30.net/hubfs/4554639/Imported_Blog_Media/rawpixel-800772-unsplash.jpg","nextPostFeaturedImageAltText":"","nextPostName":"Coviu appoints Dr Amandeep Hansra to its Board of Directors","nextPostSlug":"en-au/resources/blog/2018/10/04/coviu-appoints-dr-amandeep-hansra-to-its-board-of-directors","pageExpiryDate":null,"pageExpiryEnabled":null,"pageExpiryRedirectId":null,"pageExpiryRedirectUrl":null,"pageRedirected":false,"pageTitle":"How we used AI to translate sign language in real time","parentBlog":{"absoluteUrl":"https://www.coviu.com/en-au/resources/blog","allowComments":false,"ampBodyColor":"#404040","ampBodyFont":"'Helvetica Neue', Helvetica, Arial, sans-serif","ampBodyFontSize":"18","ampCustomCss":"","ampHeaderBackgroundColor":"#ffffff","ampHeaderColor":"#1e1e1e","ampHeaderFont":"'Helvetica Neue', Helvetica, Arial, sans-serif","ampHeaderFontSize":"36","ampLinkColor":"#416bb3","ampLogoAlt":"","ampLogoHeight":0,"ampLogoSrc":"","ampLogoWidth":0,"analyticsPageId":5956374167,"attachedStylesheets":[],"audienceAccess":"PUBLIC","businessUnitId":null,"captchaAfterDays":7,"captchaAlways":false,"categoryId":3,"cdnPurgeEmbargoTime":null,"closeCommentsOlder":0,"commentDateFormat":"medium","commentFormGuid":"8724b334-47fd-4f90-9de6-31513d349a14","commentMaxThreadDepth":1,"commentModeration":false,"commentNotificationEmails":[],"commentShouldCreateContact":false,"commentVerificationText":"","cosObjectType":"BLOG","created":1531355843147,"createdDateTime":1531355843147,"dailyNotificationEmailId":null,"dateFormattingLanguage":null,"defaultGroupStyleId":"","defaultNotificationFromName":"Sample Author","defaultNotificationReplyTo":"SampleAuthor@hubspot.com","deletedAt":0,"description":"Discover the latest. Industry news, case studies, product updates and more.","domain":"","domainWhenPublished":"www.coviu.com","emailApiSubscriptionId":5580112,"enableGoogleAmpOutput":true,"enableSocialAutoPublishing":true,"generateJsonLdEnabled":false,"header":null,"htmlFooter":"","htmlFooterIsShared":true,"htmlHead":"\n \n","htmlHeadIsShared":true,"htmlKeywords":[],"htmlTitle":"Telehealth Blog, News & Updates | Coviu","id":5956374167,"ilsSubscriptionListsByType":{"instant":1272},"instantNotificationEmailId":"6617311775","itemLayoutId":null,"itemTemplateIsShared":false,"itemTemplatePath":"Act3 child/templates/blog-post.html","label":"Coviu Blog","language":"en-au","legacyGuid":null,"legacyModuleId":null,"legacyTabId":null,"listingLayoutId":null,"listingPageId":84629824301,"listingTemplatePath":"coviu2021/templates/blog-index.html","liveDomain":"www.coviu.com","monthFilterFormat":"MMMM yyyy","monthlyNotificationEmailId":"6619882901","name":"Coviu Blog","parentBlogUpdateTaskId":null,"portalId":4554639,"postHtmlFooter":"","postHtmlHead":"\n \n","postsPerListingPage":12,"postsPerRssFeed":10,"publicAccessRules":[],"publicAccessRulesEnabled":false,"publicTitle":"Coviu Blog","publishDateFormat":"medium","resolvedDomain":"www.coviu.com","rootUrl":"https://www.coviu.com/en-au/resources/blog","rssCustomFeed":null,"rssDescription":null,"rssItemFooter":null,"rssItemHeader":null,"settingsOverrides":{"itemLayoutId":false,"itemTemplatePath":false,"itemTemplateIsShared":false,"listingLayoutId":false,"listingTemplatePath":false,"postsPerListingPage":false,"showSummaryInListing":false,"useFeaturedImageInSummary":false,"htmlHead":false,"postHtmlHead":false,"htmlHeadIsShared":false,"htmlFooter":false,"listingPageHtmlFooter":false,"postHtmlFooter":false,"htmlFooterIsShared":false,"attachedStylesheets":false,"postsPerRssFeed":false,"showSummaryInRss":false,"showSummaryInEmails":false,"showSummariesInEmails":false,"allowComments":false,"commentShouldCreateContact":false,"commentModeration":false,"closeCommentsOlder":false,"commentNotificationEmails":false,"commentMaxThreadDepth":false,"commentVerificationText":false,"socialAccountTwitter":false,"showSocialLinkTwitter":false,"showSocialLinkLinkedin":false,"showSocialLinkFacebook":false,"enableGoogleAmpOutput":false,"ampLogoSrc":false,"ampLogoHeight":false,"ampLogoWidth":false,"ampLogoAlt":false,"ampHeaderFont":false,"ampHeaderFontSize":false,"ampHeaderColor":false,"ampHeaderBackgroundColor":false,"ampBodyFont":false,"ampBodyFontSize":false,"ampBodyColor":false,"ampLinkColor":false,"generateJsonLdEnabled":false},"showSocialLinkFacebook":true,"showSocialLinkLinkedin":true,"showSocialLinkTwitter":true,"showSummaryInEmails":true,"showSummaryInListing":true,"showSummaryInRss":true,"siteId":null,"slug":"en-au/resources/blog","socialAccountTwitter":"","state":null,"subscriptionContactsProperty":"blog_coviu_blog_5956374167_subscription","subscriptionEmailType":null,"subscriptionFormGuid":"8642d43f-425a-4c2c-838c-9075b6c9b982","subscriptionListsByType":{"instant":1555},"title":null,"translatedFromId":null,"translations":{"en-us":{"absoluteUrl":"https://www.coviu.com/en-us/resources/blog","id":37888912834,"language":"en-us","masterId":5956374167,"name":"Coviu Blog","publicAccessRules":[],"publicAccessRulesEnabled":false,"slug":"en-us/resources/blog"}},"updated":1772674760666,"updatedDateTime":1772674760666,"urlBase":"www.coviu.com/en-au/resources/blog","urlSegments":{"all":"all","archive":"archive","author":"author","page":"page","tag":"tag"},"useFeaturedImageInSummary":true,"usesDefaultTemplate":false,"weeklyNotificationEmailId":"6619888455"},"password":null,"pastMabExperimentIds":[],"performableGuid":null,"performableVariationLetter":null,"personalizationStrategyId":null,"personalizationVariantStatus":null,"personas":[],"placementGuids":[],"portableKey":null,"portalId":4554639,"position":null,"postBody":"

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n\n

$\"Coviu$

The problem

Our solution

Our goal was a lofty one - create a web application that uses a computer’s webcam to capture a person signing the Auslan alphabet, and translate it in real time. This would involve:

Gathering data
Training a machine learning model to recognise the Auslan alphabet
Building the user interface

Building the Auslan Alphabet Image Dataset

$\"dataset$

Example of a manually-drawn bounding box for the letter \"B\"

This kind of data exists for ASL, but as it turns out, there is no dataset of images of the Auslan alphabet.

Proof of concept

Excuse the poor signing - Tom only learned the signs 10 minutes prior!

[youtube https://www.youtube.com/watch?v=DDGplO5jB4M?rel=0&showinfo=0&w=560&h=315]

Training a machine learning model

$\"machine$

Our machine learning model at work

The Final Push

But we wanted to push it even further. We wanted a demo-able application.

$\"HealthHack$

For those interested in the technical details, all of our code is open source on Github for everyone to use and improve.

We were able to build the first Auslan alphabet image dataset, train a machine learning model from scratch, AND make a Python web app which could translate sign language in real time- in one weekend!

Our team

Where to from here

We’d love to learn more about how we can help bring telehealth to the hearing-impaired. If you’re interested in learning more, please reach out us at support@coviu.com

","postBodyRss":"

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n\n

$\"Coviu$

The problem

Our solution

Our goal was a lofty one - create a web application that uses a computer’s webcam to capture a person signing the Auslan alphabet, and translate it in real time. This would involve:

Gathering data
Training a machine learning model to recognise the Auslan alphabet
Building the user interface

Building the Auslan Alphabet Image Dataset

$\"dataset$

Example of a manually-drawn bounding box for the letter \"B\"

This kind of data exists for ASL, but as it turns out, there is no dataset of images of the Auslan alphabet.

Proof of concept

Excuse the poor signing - Tom only learned the signs 10 minutes prior!

[youtube https://www.youtube.com/watch?v=DDGplO5jB4M?rel=0&showinfo=0&w=560&h=315]

Training a machine learning model

$\"machine$

Our machine learning model at work

The Final Push

But we wanted to push it even further. We wanted a demo-able application.

$\"HealthHack$

For those interested in the technical details, all of our code is open source on Github for everyone to use and improve.

We were able to build the first Auslan alphabet image dataset, train a machine learning model from scratch, AND make a Python web app which could translate sign language in real time- in one weekend!

Our team

Where to from here

We’d love to learn more about how we can help bring telehealth to the hearing-impaired. If you’re interested in learning more, please reach out us at support@coviu.com

","postEmailContent":"

\n Tom Quirk \n
\n Software engineer\n

\n Kaamraan Kamaal \n
\n Software engineer\n

","postFeaturedImageIfEnabled":"https://f.hubspotusercontent30.net/hubfs/4554639/Imported_Blog_Media/franck-v-740564-unsplash.jpg","postListContent":"

\n Tom Quirk \n
\n Software engineer\n

\n Kaamraan Kamaal \n
\n Software engineer\n

","postListSummaryFeaturedImage":"https://f.hubspotusercontent30.net/hubfs/4554639/Imported_Blog_Media/franck-v-740564-unsplash.jpg","postRssContent":"

\n Tom Quirk \n
\n Software engineer\n

\n Kaamraan Kamaal \n
\n Software engineer\n

","postRssSummaryFeaturedImage":"https://f.hubspotusercontent30.net/hubfs/4554639/Imported_Blog_Media/franck-v-740564-unsplash.jpg","postSummary":"

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n","postSummaryRss":"

\n Tom Quirk \n
\n Software engineer\n

\n Kaamraan Kamaal \n
\n Software engineer\n

","postTemplate":"Act3 child/templates/blog-post.html","previewImageSrc":null,"previewKey":"JcXgLmsy","previousPostFeaturedImage":"https://f.hubspotusercontent30.net/hubfs/4554639/Imported_Blog_Media/e7f54-1necelwyngjdn7sr-56lamg-1.jpeg","previousPostFeaturedImageAltText":"","previousPostName":"11 reasons why Coviu is right for your practice:","previousPostSlug":"en-au/resources/blog/2018/09/20/11-reasons-why-coviu-is-right-for-your-practice","processingStatus":"PUBLISHED","propertyForDynamicPageCanonicalUrl":null,"propertyForDynamicPageFeaturedImage":null,"propertyForDynamicPageMetaDescription":null,"propertyForDynamicPageSlug":null,"propertyForDynamicPageTitle":null,"publicAccessRules":[],"publicAccessRulesEnabled":false,"publishDate":1537490722000,"publishDateLocalTime":1537490722000,"publishDateLocalized":{"date":1537490722000,"format":"medium","language":null},"publishImmediately":false,"publishTimezoneOffset":null,"publishedAt":1597191697222,"publishedByEmail":null,"publishedById":10287886,"publishedByName":null,"publishedUrl":"https://www.coviu.com/en-au/resources/blog/2018/09/21/how-we-used-ai-to-translate-sign-language-in-real-time","resolvedDomain":"www.coviu.com","resolvedLanguage":null,"rssBody":"

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n\n

$\"Coviu$

The problem

Our solution

Our goal was a lofty one - create a web application that uses a computer’s webcam to capture a person signing the Auslan alphabet, and translate it in real time. This would involve:

Gathering data
Training a machine learning model to recognise the Auslan alphabet
Building the user interface

Building the Auslan Alphabet Image Dataset

$\"dataset$

Example of a manually-drawn bounding box for the letter \"B\"

This kind of data exists for ASL, but as it turns out, there is no dataset of images of the Auslan alphabet.

Proof of concept

Excuse the poor signing - Tom only learned the signs 10 minutes prior!

[youtube https://www.youtube.com/watch?v=DDGplO5jB4M?rel=0&showinfo=0&w=560&h=315]

Training a machine learning model

$\"machine$

Our machine learning model at work

The Final Push

But we wanted to push it even further. We wanted a demo-able application.

$\"HealthHack$

For those interested in the technical details, all of our code is open source on Github for everyone to use and improve.

We were able to build the first Auslan alphabet image dataset, train a machine learning model from scratch, AND make a Python web app which could translate sign language in real time- in one weekend!

Our team

Where to from here

We’d love to learn more about how we can help bring telehealth to the hearing-impaired. If you’re interested in learning more, please reach out us at support@coviu.com

","rssSummary":"

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n","rssSummaryFeaturedImage":"https://f.hubspotusercontent30.net/hubfs/4554639/Imported_Blog_Media/franck-v-740564-unsplash.jpg","scheduledUpdateDate":null,"screenshotPreviewTakenAt":1597191697336,"screenshotPreviewUrl":"https://cdn2.hubspot.net/hubshot/20/08/12/1e7a75fb-3f5b-49d2-a2c9-85e0e11f10aa.png","sections":{},"securityState":"NONE","siteId":null,"slug":"en-au/resources/blog/2018/09/21/how-we-used-ai-to-translate-sign-language-in-real-time","stagedFrom":null,"state":"PUBLISHED","stateWhenDeleted":null,"structuredContentPageType":null,"structuredContentType":null,"styleOverrideId":null,"subcategory":"imported_blog_post","syncedWithBlogRoot":true,"tagIds":[31249334600,31249336116,31249336127,31249336129,31249336140,31249336144,31249336149,31249336174,31249336189,31249336190,31249336192,31249336197,31249336225],"tagList":[{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045461212,"deletedAt":0,"description":"","id":31249334600,"label":"Machine Learning","language":null,"name":"Machine Learning","portalId":4554639,"slug":"machine-learning","translatedFromId":null,"translations":{},"updated":1593045461212},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462274,"deletedAt":0,"description":"","id":31249336116,"label":"Accessibility","language":null,"name":"Accessibility","portalId":4554639,"slug":"accessibility","translatedFromId":null,"translations":{},"updated":1593045462274},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462512,"deletedAt":0,"description":"","id":31249336127,"label":"Python","language":null,"name":"Python","portalId":4554639,"slug":"python","translatedFromId":null,"translations":{},"updated":1593045462512},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462548,"deletedAt":0,"description":"","id":31249336129,"label":"awards","language":null,"name":"awards","portalId":4554639,"slug":"awards","translatedFromId":null,"translations":{},"updated":1593045462548},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462742,"deletedAt":0,"description":"","id":31249336140,"label":"JavaScript","language":null,"name":"JavaScript","portalId":4554639,"slug":"javascript","translatedFromId":null,"translations":{},"updated":1593045462742},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462843,"deletedAt":0,"description":"","id":31249336144,"label":"Online consultations","language":null,"name":"Online consultations","portalId":4554639,"slug":"online-consultations","translatedFromId":null,"translations":{},"updated":1593045462843},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462949,"deletedAt":0,"description":"","id":31249336149,"label":"Software","language":null,"name":"Software","portalId":4554639,"slug":"software","translatedFromId":null,"translations":{},"updated":1593045462949},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463400,"deletedAt":0,"description":"","id":31249336174,"label":"Design","language":null,"name":"Design","portalId":4554639,"slug":"design","translatedFromId":null,"translations":{},"updated":1593045463400},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463672,"deletedAt":0,"description":"","id":31249336189,"label":"Auslan","language":null,"name":"Auslan","portalId":4554639,"slug":"auslan","translatedFromId":null,"translations":{},"updated":1593045463672},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463689,"deletedAt":0,"description":"","id":31249336190,"label":"Healthcare","language":null,"name":"Healthcare","portalId":4554639,"slug":"healthcare","translatedFromId":null,"translations":{},"updated":1593045463689},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463723,"deletedAt":0,"description":"","id":31249336192,"label":"Artificial Intelligence","language":null,"name":"Artificial Intelligence","portalId":4554639,"slug":"artificial-intelligence","translatedFromId":null,"translations":{},"updated":1593045463723},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463814,"deletedAt":0,"description":"","id":31249336197,"label":"Technology","language":null,"name":"Technology","portalId":4554639,"slug":"technology","translatedFromId":null,"translations":{},"updated":1593045463814},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045464332,"deletedAt":0,"description":"","id":31249336225,"label":"Tech","language":null,"name":"Tech","portalId":4554639,"slug":"tech","translatedFromId":null,"translations":{},"updated":1593045464332}],"tagNames":["Machine Learning","Accessibility","Python","awards","JavaScript","Online consultations","Software","Design","Auslan","Healthcare","Artificial Intelligence","Technology","Tech"],"teamPerms":[],"templatePath":"","templatePathForRender":"Act3 child/templates/blog-post.html","textToAudioFileId":null,"textToAudioGenerationRequestId":null,"themePath":null,"themeSettingsValues":null,"title":"How we used AI to translate sign language in real time","tmsId":null,"topicIds":[31249334600,31249336116,31249336127,31249336129,31249336140,31249336144,31249336149,31249336174,31249336189,31249336190,31249336192,31249336197,31249336225],"topicList":[{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045461212,"deletedAt":0,"description":"","id":31249334600,"label":"Machine Learning","language":null,"name":"Machine Learning","portalId":4554639,"slug":"machine-learning","translatedFromId":null,"translations":{},"updated":1593045461212},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462274,"deletedAt":0,"description":"","id":31249336116,"label":"Accessibility","language":null,"name":"Accessibility","portalId":4554639,"slug":"accessibility","translatedFromId":null,"translations":{},"updated":1593045462274},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462512,"deletedAt":0,"description":"","id":31249336127,"label":"Python","language":null,"name":"Python","portalId":4554639,"slug":"python","translatedFromId":null,"translations":{},"updated":1593045462512},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462548,"deletedAt":0,"description":"","id":31249336129,"label":"awards","language":null,"name":"awards","portalId":4554639,"slug":"awards","translatedFromId":null,"translations":{},"updated":1593045462548},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462742,"deletedAt":0,"description":"","id":31249336140,"label":"JavaScript","language":null,"name":"JavaScript","portalId":4554639,"slug":"javascript","translatedFromId":null,"translations":{},"updated":1593045462742},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462843,"deletedAt":0,"description":"","id":31249336144,"label":"Online consultations","language":null,"name":"Online consultations","portalId":4554639,"slug":"online-consultations","translatedFromId":null,"translations":{},"updated":1593045462843},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045462949,"deletedAt":0,"description":"","id":31249336149,"label":"Software","language":null,"name":"Software","portalId":4554639,"slug":"software","translatedFromId":null,"translations":{},"updated":1593045462949},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463400,"deletedAt":0,"description":"","id":31249336174,"label":"Design","language":null,"name":"Design","portalId":4554639,"slug":"design","translatedFromId":null,"translations":{},"updated":1593045463400},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463672,"deletedAt":0,"description":"","id":31249336189,"label":"Auslan","language":null,"name":"Auslan","portalId":4554639,"slug":"auslan","translatedFromId":null,"translations":{},"updated":1593045463672},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463689,"deletedAt":0,"description":"","id":31249336190,"label":"Healthcare","language":null,"name":"Healthcare","portalId":4554639,"slug":"healthcare","translatedFromId":null,"translations":{},"updated":1593045463689},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463723,"deletedAt":0,"description":"","id":31249336192,"label":"Artificial Intelligence","language":null,"name":"Artificial Intelligence","portalId":4554639,"slug":"artificial-intelligence","translatedFromId":null,"translations":{},"updated":1593045463723},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045463814,"deletedAt":0,"description":"","id":31249336197,"label":"Technology","language":null,"name":"Technology","portalId":4554639,"slug":"technology","translatedFromId":null,"translations":{},"updated":1593045463814},{"categoryId":3,"cdnPurgeEmbargoTime":null,"contentIds":[],"cosObjectType":"TAG","created":1593045464332,"deletedAt":0,"description":"","id":31249336225,"label":"Tech","language":null,"name":"Tech","portalId":4554639,"slug":"tech","translatedFromId":null,"translations":{},"updated":1593045464332}],"topicNames":["Machine Learning","Accessibility","Python","awards","JavaScript","Online consultations","Software","Design","Auslan","Healthcare","Artificial Intelligence","Technology","Tech"],"topics":[31249334600,31249336116,31249336127,31249336129,31249336140,31249336144,31249336149,31249336174,31249336189,31249336190,31249336192,31249336197,31249336225],"translatedContent":{},"translatedFromId":null,"translations":{},"tweet":null,"tweetAt":null,"tweetImmediately":false,"unpublishedAt":0,"updated":1597191697229,"updatedById":10287886,"upsizeFeaturedImage":false,"url":"https://www.coviu.com/en-au/resources/blog/2018/09/21/how-we-used-ai-to-translate-sign-language-in-real-time","useFeaturedImage":true,"userPerms":[],"views":0,"visibleToAll":null,"widgetContainers":{},"widgetcontainers":{},"widgets":{"blog_comments":{"body":{"definition_id":null,"extra_classes":"widget-type-blog_comments","field_types":{},"module_id":1366601,"path":"@hubspot/blog_comments","smart_objects":[],"smart_type":"NOT_SMART","tag":"module","type":"module","wrap_field_tag":"div"},"child_css":{},"css":{},"id":"blog_comments","label":null,"module_id":1366601,"name":"blog_comments","order":7,"smart_type":null,"styles":{},"type":"module"},"blog_post_banner":{"body":{"definition_id":null,"field_types":{"button_icon":"icon","button_text":"text","style":"group"},"module_id":23668926620,"path":"../modules/blog-post-banner","smart_objects":[],"smart_type":"NOT_SMART","tag":"module","type":"module","wrap_field_tag":"div"},"child_css":{},"css":{},"id":"blog_post_banner","label":null,"module_id":23668926620,"name":"blog_post_banner","order":4,"smart_type":null,"styles":{},"type":"module"},"blog_related_posts":{"body":{"definition_id":null,"field_types":{},"module_id":23668820189,"path":"../modules/blog-related-posts","smart_objects":[],"smart_type":"NOT_SMART","tag":"module","type":"module","wrap_field_tag":"div"},"child_css":{},"css":{},"id":"blog_related_posts","label":null,"module_id":23668820189,"name":"blog_related_posts","order":9,"smart_type":null,"styles":{},"type":"module"},"name":{"body":{"title":"How we used AI to translate sign language in real time"},"id":"name","label":"Title","name":"name","type":"text"},"post_body":{"body":{"html":"

$\"Tom$

Tom Quirk
Software engineer

$\"Kaam,$

Kaamraan Kamaal
Software engineer

\n\n

$\"Coviu$

The problem

Our solution

Our goal was a lofty one - create a web application that uses a computer’s webcam to capture a person signing the Auslan alphabet, and translate it in real time. This would involve:

Gathering data
Training a machine learning model to recognise the Auslan alphabet
Building the user interface

Building the Auslan Alphabet Image Dataset

$\"dataset$

Example of a manually-drawn bounding box for the letter \"B\"

This kind of data exists for ASL, but as it turns out, there is no dataset of images of the Auslan alphabet.

Proof of concept

Excuse the poor signing - Tom only learned the signs 10 minutes prior!

[youtube https://www.youtube.com/watch?v=DDGplO5jB4M?rel=0&showinfo=0&w=560&h=315]

Training a machine learning model

$\"machine$

Our machine learning model at work

The Final Push

But we wanted to push it even further. We wanted a demo-able application.

$\"HealthHack$

For those interested in the technical details, all of our code is open source on Github for everyone to use and improve.

We were able to build the first Auslan alphabet image dataset, train a machine learning model from scratch, AND make a Python web app which could translate sign language in real time- in one weekend!

Our team

Where to from here

We’d love to learn more about how we can help bring telehealth to the hearing-impaired. If you’re interested in learning more, please reach out us at support@coviu.com

"},"id":"post_body","label":"Blog Content","name":"post_body","type":"rich_text"}}}

Contact

Free Trial Book a Demo

Contact

Free Trial Book a Demo Sign In

Book your free 15-minute telehealth efficiency review

How we used AI to translate sign language in real time

<span id="hs_cos_wrapper_name" class="hs_cos_wrapper hs_cos_wrapper_meta_field hs_cos_wrapper_type_text" style="" data-hs-cos-general-type="meta_field" data-hs-cos-type="text" >How we used AI to translate sign language in real time</span>

Tom Quirk
Software engineer

Kaamraan Kamaal
Software engineer

Coviu at HealthHack 2018

The problem

Our solution

Our goal was a lofty one - create a web application that uses a computer’s webcam to capture a person signing the Auslan alphabet, and translate it in real time. This would involve:

Gathering data
Training a machine learning model to recognise the Auslan alphabet
Building the user interface

Building the Auslan Alphabet Image Dataset

Example of a manually-drawn bounding box for the letter "B"

This kind of data exists for ASL, but as it turns out, there is no dataset of images of the Auslan alphabet.

Proof of concept

Excuse the poor signing - Tom only learned the signs 10 minutes prior!

[youtube https://www.youtube.com/watch?v=DDGplO5jB4M?rel=0&showinfo=0&w=560&h=315]

Training a machine learning model

Our machine learning model at work

The Final Push

But we wanted to push it even further. We wanted a demo-able application.

HealthHack 2018- Coviu's team

For those interested in the technical details, all of our code is open source on Github for everyone to use and improve.

We were able to build the first Auslan alphabet image dataset, train a machine learning model from scratch, AND make a Python web app which could translate sign language in real time- in one weekend!

Our team

Where to from here

We’d love to learn more about how we can help bring telehealth to the hearing-impaired. If you’re interested in learning more, please reach out us at support@coviu.com

← 11 reasons why Coviu is right for your practice:

Coviu appoints Dr Amandeep Hansra to its Board of Directors →

The problem

Our solution

Building the Auslan Alphabet Image Dataset

Proof of concept

Training a machine learning model

The Final Push

Our team

Where to from here

The problem

Our solution

Building the Auslan Alphabet Image Dataset

Proof of concept

Training a machine learning model

The Final Push

Our team

Where to from here

How we used AI to translate sign language in real time

The problem

Our solution

Building the Auslan Alphabet Image Dataset

Proof of concept

Training a machine learning model

The Final Push

Our team

Where to from here

You May Also Like

Virtual Care for Elderly in the Community with Blue Care

Telehealth Tools Thursday - Pearson BDI-2, BAI and BHS

Monthly Round-Up - September Updates!