{"id":2186,"date":"2023-12-12T19:35:00","date_gmt":"2023-12-12T19:35:00","guid":{"rendered":"https:\/\/www.darkreading.com\/cloud-security\/google-cloud-dataproc-abuse-risk-corporate-data-stores"},"modified":"2023-12-12T19:35:00","modified_gmt":"2023-12-12T19:35:00","slug":"google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores","status":"publish","type":"post","link":"https:\/\/ddi.mohflo.net\/index.php\/2023\/12\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores\/","title":{"rendered":"Google Cloud&#8217;s &#8216;Dataproc&#8217; Abuse Risk Endangers Corporate Data Stores"},"content":{"rendered":"<div class=\"media_block\"><a href=\"https:\/\/i0.wp.com\/eu-images.contentstack.com\/v3\/assets\/blt6d90778a997de1cd\/blt688134ade1090d26\/64f17de5ce538fffc6e3f541\/data_center_Antoni_Bastien_Alamy.jpg?ssl=1\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores.jpg?w=640&#038;ssl=1\" class=\"media_thumbnail\"><\/a><\/div>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">Lackluster security controls in one of Google&#8217;s cloud services for data scientists could allow hackers to create applications, execute operations, and access data in Internet-facing environments.<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">The issue lies with Google Cloud&#8217;s &#8220;Dataproc,&#8221; a managed service for running large-scale data processing and analytics workloads via Apache Hadoop, Spark, and more than 30 other open source tools and frameworks.<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">A so-called &#8220;abuse risk&#8221; to Dataproc, <\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\"><a href=\"https:\/\/orca.security\/resources\/blog\/unauthenticated-access-to-google-cloud-dataproc\" target=\"_blank\" class=\"ContentText-BodyTextChunk ContentText-BodyTextChunk_link\" rel=\"noopener\">outlined by the Orca Research Pod on Dec. 12<\/a><\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">, rests on the presence of two default open firewall ports used by Dataproc. If an attacker is able to achieve initial <\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\"><a href=\"https:\/\/www.darkreading.com\/vulnerabilities-threats\/scarleteel-hackers-worm-into-aws-cloud\" target=\"_blank\" class=\"ContentText-BodyTextChunk ContentText-BodyTextChunk_link\" rel=\"noopener\">server compromise in an exposed cloud environment<\/a><\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\"> (through a common misconfiguration, say), they could take advantage of missing security checks to reach connected resources, such as data scientists&#8217; <\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\"><a href=\"https:\/\/www.darkreading.com\/cloud-security\/rising-public-cloud-adoption-is-accelerating-shadow-data-risks\" target=\"_blank\" class=\"ContentText-BodyTextChunk ContentText-BodyTextChunk_link\" rel=\"noopener\">reams of sensitive data<\/a><\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">. They could also toy with their cloud environments in myriad other ways.<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">&#8220;One can imagine that the data used for analysis is likely to contain proprietary as well as sensitive data, which, if breached could provide bad actors with customer data, business intelligence, and other data that could be used for competitive intelligence,&#8221; says Roi Nisimi, cloud threat researcher at Orca Security.<\/span><\/p>\n<h2 class=\"ContentText ContentText_variant_h2 ContentText_align_left\" data-testid=\"content-text\"><span class=\"ContentText-BodyTextChunk ContentText-BodyTextChunk_bold\">Exposed Dataproc in Default Private Cloud<\/span><\/h2>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">Dataproc&#8217;s issues begin with the fact that its two Web interfaces used for every master node \u2014 YARN ResourceManager on port 8088 and Apache&#8217;s Hadoop Distributed File System (HDFS) NameNode on port 9870 \u2014 don&#8217;t require any authentication.<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">&#8220;The two ports mentioned above are served for all addresses,&#8221; according to Orca. &#8220;Which means to fully access them, the one single prerequisite is Internet access. So one not properly segmented cluster can cause great damage.&#8221;<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">As for the specific potential attack path, the researchers note that it&#8217;s &#8220;fairly simple.&#8221;&nbsp;<\/span><\/p>\n<div><img decoding=\"async\" data-testid=\"content-image\" data-component=\"image\" class=\"ContentImage-Image ContentImage-Image_align_left\" data-src=\"https:\/\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores.png\" src=\"https:\/\/lh7-us.googleusercontent.com\/Rw7nGNN_tCerXEJFF4P2bjbfFauH2ToKj-Phf8XHPORnccTdXqF-66oie7VWDNKJg2najgGtwQy3zPEuPsO2KU2Ftbk0jRDLfMnwDQaxOxh8aXF4UpDkKj4Yd7cS09bLv05wdcJzkEIMGOS7Oloh0nY?width=502&amp;auto=webp&amp;quality=10&amp;disable=upscale&amp;blur=40\" loading=\"lazy\" alt=\"Google Dataproc Data Leak Attack Flow\" title=\"Google Dataproc Data Leak Attack Flow\"><\/p>\n<p class=\"ContentImage-Link\">Source: Orca Security<\/p>\n<\/div>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">Google Cloud comes packaged with a default virtual private cloud (VPC) called Compute Engine, which, while limiting most inbound connections, does not limit any connections within an organization&#8217;s internal subnetwork. So, if an attacker can breach and execute code in the default VPC \u2014 say, if it&#8217;s left open to the Internet \u2014 they have a path to access Dataproc clusters because those two interfaces are left open by default.<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">&#8220;The attacker can now tunnel through the compromised machine to access both Web interfaces,&#8221; the researchers explained. &#8220;They can use the YARN endpoint to create applications, submit jobs and perform <\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\"><a href=\"https:\/\/cloud.google.com\/dataproc\/docs\/concepts\/accessing\/cluster-web-interfaces#allowed_yarn_resourcemanager_rest_apis\" target=\"_blank\" class=\"ContentText-BodyTextChunk ContentText-BodyTextChunk_link\" rel=\"noopener\">Cloud Storage operations<\/a><\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">. &#8230; Or worse, they can use the HDFS endpoint to browse through the storage file system and obtain full access to sensitive data.&#8221;<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">The upshot, as researchers explained: &#8220;Having an Internet-facing remote code execution (RCE) \u2014 vulnerable Compute Engine instance is not farfetched.&#8221;<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">The researchers brought their findings to Google, but the issue has not yet been resolved. Google also has not responded to Dark Reading&#8217;s request for comment on this story.<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">Nisimi says that Google could implement a fix rather easily. \u201cPotential solutions would prevent unauthenticated access to the cluster Web interfaces,\u201d he explains. \u201cFor example, Google could enable authentication by default in the underlying open source software (OSS) managed solution, so that GCP Dataproc only allows authenticated access.\u201d<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">Orca did acknowledge that <\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\"><a href=\"https:\/\/cloud.google.com\/dataproc\/docs\/concepts\/accessing\/cluster-web-interfaces\" target=\"_blank\" class=\"ContentText-BodyTextChunk ContentText-BodyTextChunk_link\" rel=\"noopener\">Google&#8217;s Dataproc documentation<\/a><\/span><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">&nbsp;highlights this potential security risk and suggests avoiding open firewall rules on a public network, but &#8220;they don\u2019t take into account the risk of an attacker already having an initial foothold on a Compute Engine instance \u2014 which would give them unauthenticated access to GCP Dataproc as well,&#8221; according to the Orca post.<\/span><\/p>\n<h2 class=\"ContentText ContentText_variant_h2 ContentText_align_left\" data-testid=\"content-text\"><span class=\"ContentText-BodyTextChunk ContentText-BodyTextChunk_bold\">Avoiding Cyber-Risk in Exposed Dataproc<\/span><\/h2>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">To address such possibilities, the researchers recommended that Dataproc admins practice effective vulnerability management and properly segment their networks by creating independent clusters in different subnets, without cross-contamination with other services. Admins can also adjust firewall rules, or move to other VPCs.<\/span><\/p>\n<p class=\"ContentParagraph ContentParagraph_align_left\" data-testid=\"content-paragraph\"><span class=\"ContentText ContentText_variant_bodyNormal\" data-testid=\"content-text\">Unless Google itself implements some sort of fix, the researchers wrote, &#8220;it\u2019s up to organizations themselves to ensure that their GCP Dataproc clusters are not configured in a way that makes them vulnerable.&#8221;<\/span><\/p>\n<p><a href=\"https:\/\/www.darkreading.com\/cloud-security\/google-cloud-dataproc-abuse-risk-corporate-data-stores\">Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Lackluster security controls in one of Google&#8217;s cloud services for<\/p>\n","protected":false},"author":12,"featured_media":2187,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1],"tags":[809],"class_list":["post-2186","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","tag-dark-reading"],"featured_image_urls":{"full":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?fit=2560%2C1709&ssl=1",2560,1709,false],"thumbnail":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?resize=150%2C150&ssl=1",150,150,true],"medium":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?fit=300%2C200&ssl=1",300,200,true],"medium_large":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?fit=640%2C428&ssl=1",640,428,true],"large":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?fit=640%2C428&ssl=1",640,428,true],"1536x1536":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?fit=1536%2C1025&ssl=1",1536,1025,true],"2048x2048":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?fit=2048%2C1367&ssl=1",2048,1367,true],"chromenews-featured":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?fit=1024%2C684&ssl=1",1024,684,true],"chromenews-large":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?resize=825%2C575&ssl=1",825,575,true],"chromenews-medium":["https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?resize=590%2C410&ssl=1",590,410,true]},"author_info":{"display_name":"Dark Reading","author_link":"https:\/\/ddi.mohflo.net\/index.php\/author\/darkreading\/"},"category_info":"<a href=\"https:\/\/ddi.mohflo.net\/index.php\/category\/uncategorized\/\" rel=\"category tag\">Uncategorized<\/a>","tag_info":"Uncategorized","comment_count":"0","jetpack_featured_media_url":"https:\/\/i0.wp.com\/ddi.mohflo.net\/wp-content\/uploads\/2023\/12\/google-clouds-dataproc-abuse-risk-endangers-corporate-data-stores-scaled.jpg?fit=2560%2C1709&ssl=1","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/posts\/2186","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/comments?post=2186"}],"version-history":[{"count":0,"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/posts\/2186\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/media\/2187"}],"wp:attachment":[{"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/media?parent=2186"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/categories?post=2186"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ddi.mohflo.net\/index.php\/wp-json\/wp\/v2\/tags?post=2186"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}