Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
52 changes: 52 additions & 0 deletions AGENTS.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,52 @@
<!--
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
this work for additional information regarding copyright ownership.
The ASF licenses this file to you under the Apache License, Version 2.0
(the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->

# AGENTS.md

## Security

You are helping a security researcher find and report vulnerabilities in
Apache Hive. Before drafting any report or reaching any conclusion, complete
these steps.

### Step 1 — Read the threat model
Read **[THREAT_MODEL.md](THREAT_MODEL.md)**: the trust boundaries (the
HiveServer2 SQL front door, the Metastore, the query/UDF execution layer), the
adversaries in and out of scope, and what Hive upholds versus what it leaves to
the operator.

### Step 2 — Read the security policy
Read **[SECURITY.md](SECURITY.md)** for how to report.

### Key scoping facts (see THREAT_MODEL.md)
- The **HiveServer2** SQL front door is the primary untrusted boundary; the
Metastore and execution cluster are assumed to run inside an
operator-controlled perimeter.
- **UDFs, SerDes, custom InputFormats, and `TRANSFORM` scripts are
code-execution by design**, not a sandbox — running authorized code is a
feature, not a vulnerability.
- Transport security (TLS), the choice of authorization model (Ranger /
SQL-standard / storage-based), and network isolation are **operator**
responsibilities, not engine invariants.
- Hive does **not** defend against an operator with `root`, the Hadoop
superuser, or direct HDFS / metastore-DB access.

### Step 3 — Route the finding
Route the finding to exactly one disposition in **THREAT_MODEL.md §13**
(VALID, or one of the `OUT-OF-MODEL` / `BY-DESIGN` dispositions) and cite the
section that justifies the call. This model is **v0** — open questions for the
PMC are in §14.
40 changes: 40 additions & 0 deletions SECURITY.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@
<!--
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
this work for additional information regarding copyright ownership.
The ASF licenses this file to you under the Apache License, Version 2.0
(the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->

# Security Policy

## Reporting a Vulnerability

Please report suspected security vulnerabilities in Apache Hive **privately**
to the Hive security list at `security@hive.apache.org`, following the
[Apache Software Foundation security process](https://www.apache.org/security/).
Do **not** open public GitHub issues or pull requests for security reports — a
private report lets the issue be investigated and fixed before disclosure.

## Threat Model

A threat model for Apache Hive is maintained in
[THREAT_MODEL.md](THREAT_MODEL.md). It describes the trust boundaries (the
HiveServer2 SQL front door, the Metastore, the query/UDF execution layer), the
adversaries in and out of scope, the security properties Hive upholds given its
deployment assumptions versus those left to the operator (transport security,
authorization-model choice, network isolation, UDF vetting), and the recurring
non-findings. Triagers of scanner, fuzzer, or AI-generated findings should
route each through `THREAT_MODEL.md` §13.

This file is **v0** and carries open questions for the Hive PMC in
`THREAT_MODEL.md` §14.
Loading
Loading