Orchestrator API¶

`llm_expose.core.orchestrator` ¶

Orchestrator: ties together an LLM provider and a messaging client.

`Orchestrator` ¶

Connects an LLM provider to a messaging client.

The orchestrator maintains per-channel conversation history so that the LLM retains context across multiple turns in a single chat session. Each channel can have its own system prompt to provide context about the communication channel being used.

Parameters:

Name	Type	Description	Default
`config`	`ExposureConfig`	The :class:`~llm_expose.config.models.ExposureConfig` that describes both the provider and the client to use.	required
`provider`	`BaseProvider`	An initialised :class:`~llm_expose.providers.base.BaseProvider`.	required
`client`	`BaseClient`	An initialised :class:`~llm_expose.clients.base.BaseClient`.	required

Source code in llm_expose/core/orchestrator.py

class Orchestrator:
    """Connects an LLM *provider* to a messaging *client*.

    The orchestrator maintains per-channel conversation history so that the
    LLM retains context across multiple turns in a single chat session. Each
    channel can have its own system prompt to provide context about the
    communication channel being used.

    Args:
        config: The :class:`~llm_expose.config.models.ExposureConfig` that
            describes both the provider and the client to use.
        provider: An initialised :class:`~llm_expose.providers.base.BaseProvider`.
        client: An initialised :class:`~llm_expose.clients.base.BaseClient`.
    """

    def __init__(
        self,
        config: ExposureConfig,
        provider: BaseProvider,
        client: BaseClient,
    ) -> None:
        self._config = config
        self._provider = provider
        self._client = client
        # Per-channel conversation histories (keyed by channel ID).
        # Each channel gets its own history with the configured system prompt.
        self._histories: dict[str, list[Message]] = {}
        # Store path to system prompt file for lazy loading
        self._system_prompt_path = config.client.system_prompt_path
        # Cache the loaded system prompt content
        self._loaded_system_prompt: str | None = None
        # Backward compatibility for older tests/callers that access _history
        # or call _handle_message(user_message) without a channel ID.
        self._default_channel_id = "__default__"
        self._history = self._get_or_create_history(self._default_channel_id)
        self._tool_choice: ToolChoice | None = "auto"
        self._mcp_runtime: MCPRuntimeManager | None = None
        self._mcp_runtime_initialized = False
        self._mcp_runtime_lock = asyncio.Lock()
        self._mcp_settings = MCPSettingsConfig()
        self._pending_approvals: dict[str, _PendingApproval] = {}
        self._approval_ttl_seconds = 600
        self._channel_name = config.channel_name
        self._paired_channel_ids: set[str] = set()
        self._usage_by_channel: dict[str, _UsageStats] = {}
        self._startup_warnings: list[str] = []

        if self._channel_name:
            try:
                self._paired_channel_ids = set(
                    get_pairs_for_channel(self._channel_name)
                )
            except Exception as exc:  # pragma: no cover - defensive logging path
                logger.warning("Failed to load channel pairs: %s", exc)

        try:
            mcp_config = load_mcp_config()
            self._mcp_settings = mcp_config.settings
            attached_server_names = set(config.client.mcp_servers)
            if attached_server_names:
                available_server_names = {server.name for server in mcp_config.servers}
                missing_server_names = sorted(
                    attached_server_names - available_server_names
                )
                if missing_server_names:
                    logger.warning(
                        "Channel '%s' has missing MCP server attachments: %s",
                        config.name,
                        ", ".join(missing_server_names),
                    )

                mcp_config.servers = [
                    server
                    for server in mcp_config.servers
                    if server.name in attached_server_names
                ]
            else:
                mcp_config.servers = []

            if any(server.enabled for server in mcp_config.servers):
                self._mcp_runtime = MCPRuntimeManager(mcp_config)
        except Exception as exc:  # pragma: no cover - defensive logging path
            logger.warning("Failed to load MCP config: %s", exc)

    async def _ensure_mcp_runtime_ready(self) -> None:
        if self._mcp_runtime is None or self._mcp_runtime_initialized:
            return
        async with self._mcp_runtime_lock:
            if self._mcp_runtime_initialized:
                return
            await self._mcp_runtime.initialize()
            self._mcp_runtime_initialized = True

    def _load_system_prompt(self) -> str:
        """Load the system prompt from file, with fallback to default.

        If system_prompt_path is configured:
        - Reads the file and returns its contents (trimmed).
        - If file doesn't exist: logs warning, emits CLI warning, returns default prompt.

        If no path is configured:
        - Returns the default system prompt.

        Caches the result after the first load to avoid repeated file I/O.

        Returns:
            The system prompt text to use.
        """
        # Return cached result if already loaded
        if self._loaded_system_prompt is not None:
            return self._loaded_system_prompt

        # No custom prompt path configured, use default
        if not self._system_prompt_path:
            self._loaded_system_prompt = _DEFAULT_SYSTEM_PROMPT
            return self._loaded_system_prompt

        # Try to read the prompt file
        prompt_file = Path(self._system_prompt_path)
        if not prompt_file.exists():
            warning_msg = f"System prompt file not found: {self._system_prompt_path}"
            logger.warning(warning_msg)
            # Store warning for CLI to display (will be picked up during startup)
            self._startup_warnings.append(warning_msg)
            self._loaded_system_prompt = _DEFAULT_SYSTEM_PROMPT
            return self._loaded_system_prompt

        try:
            content = prompt_file.read_text(encoding="utf-8").strip()
            self._loaded_system_prompt = content if content else _DEFAULT_SYSTEM_PROMPT
            return self._loaded_system_prompt
        except Exception as e:
            error_msg = (
                f"Error reading system prompt file {self._system_prompt_path}: {e}"
            )
            logger.warning(error_msg)
            self._startup_warnings.append(error_msg)
            self._loaded_system_prompt = _DEFAULT_SYSTEM_PROMPT
            return self._loaded_system_prompt

    def _get_or_create_history(self, channel_id: str) -> list[Message]:
        """Get the conversation history for a channel, creating it if needed.

        Args:
            channel_id: Unique identifier for the channel/chat.

        Returns:
            The conversation history list for this channel.
        """
        if channel_id not in self._histories:
            logger.debug("Creating new conversation history for channel %s", channel_id)

            # Build system prompt with MCP server instructions
            system_content = self._load_system_prompt()

            # Add MCP server instructions if runtime is available and initialized
            # Use getattr to handle cases where _mcp_runtime may not be set yet (during __init__)
            mcp_runtime = getattr(self, "_mcp_runtime", None)
            mcp_runtime_initialized = getattr(self, "_mcp_runtime_initialized", False)

            if mcp_runtime and mcp_runtime_initialized:
                mcp_instructions = mcp_runtime.server_instructions
                if mcp_instructions:
                    system_content += (
                        "\n\n## These are the available MCP tools that you can call:\n\n"
                        + mcp_instructions
                    )
                    system_content += "\n\nTo call a tool, you must output only a JSON object in this format:"
                    system_content += (
                        '\n\n{"name": "tool_name", "args": {"arg_name": "value"}}'
                    )
                    system_content += "\n\nIf no tool is needed, respond normally."

            self._histories[channel_id] = [
                {"role": "system", "content": system_content}
            ]
        return self._histories[channel_id]

    async def _handle_message(
        self,
        channel_or_message: str,
        user_message: str | None = None,
        *,
        message_content: Any | None = None,
        message_context: dict[str, Any] | None = None,
    ) -> str | MessageResponse:
        """Process a single user message and return the LLM's reply.

        Appends the user message to the channel's history, calls the provider,
        then appends the assistant's reply and returns it.

        Args:
            channel_or_message: Either channel ID (new style) or user message
                (legacy one-argument style).
            user_message: Raw text from the end user when channel ID is passed.

        Returns:
            The LLM's reply as a plain string or MessageResponse with approval metadata.
        """
        if user_message is None:
            channel_id = self._default_channel_id
            text = channel_or_message
        else:
            channel_id = channel_or_message
            text = user_message

        if not self._is_channel_paired(channel_id):
            return (
                f"This instance is not paired. Run `llm-expose add pair {channel_id}`"
            )

        approval_decision = self._parse_approval_decision(text)
        if approval_decision is not None:
            return await self._handle_approval_decision(channel_id, *approval_decision)

        if channel_id in self._pending_approvals:
            pending = self._pending_approvals[channel_id]
            if self._is_pending_expired(pending):
                del self._pending_approvals[channel_id]
            else:
                return (
                    f"A tool call is waiting for confirmation (id: {pending.approval_id}). "
                    f"Reply `approve {pending.approval_id}` or `reject {pending.approval_id}`."
                )

        history = self._get_or_create_history(channel_id)
        execution_context = self._build_tool_execution_context(
            channel_id,
            message_content=message_content,
            message_context=message_context,
        )
        history.append(
            {
                "role": "user",
                "content": message_content if message_content is not None else text,
            }
        )
        logger.debug(
            "Sending %d messages to provider for channel %s",
            len(history),
            channel_id,
        )

        await self._ensure_mcp_runtime_ready()
        tools = self._mcp_runtime.tools if self._mcp_runtime is not None else []
        if not tools:
            reply = await self._provider.complete(history)
            self._record_provider_usage(channel_id)
            history.append({"role": "assistant", "content": reply})
            return reply

        # Use mixed approval handler which respects per-server confirmation settings
        mixed_reply = await self._handle_message_with_mixed_approval(
            history,
            tools,
            channel_id,
            execution_context=execution_context,
        )
        return mixed_reply

    def _build_tool_execution_context(
        self,
        channel_id: str,
        *,
        message_content: Any | None = None,
        message_context: dict[str, Any] | None = None,
        execution_mode: str = "chat",
    ) -> ToolExecutionContext:
        context = message_context or {}
        chat_type = str(context.get("chat_type") or "").strip().lower() or None
        subject_kind: Literal["user", "group", "chat", "unknown"] = "chat"
        if chat_type == "private":
            subject_kind = "user"
        elif chat_type in {"group", "supergroup", "channel"}:
            subject_kind = "group"

        initiator_user_id = context.get("effective_user_id")
        if initiator_user_id is not None:
            initiator_user_id = str(initiator_user_id)

        platform = context.get("platform") or self._config.client.client_type
        attachments = extract_invocation_attachments(message_content)

        return ToolExecutionContext(
            execution_mode="chat" if execution_mode == "chat" else "one-shot",
            channel_id=channel_id,
            channel_name=self._channel_name,
            subject_id=channel_id,
            subject_kind=subject_kind,
            initiator_user_id=initiator_user_id,
            platform=str(platform) if platform is not None else None,
            chat_type=chat_type,
            attachments=attachments,
            sender=self._client,
        )

    def _is_channel_paired(self, channel_id: str) -> bool:
        """Return whether an incoming channel ID is allowed for this exposure.

        Pair enforcement is active only when a channel namespace was supplied in
        config. This keeps backward compatibility for direct programmatic usage.
        """
        if not self._channel_name:
            return True

        # Reload persisted pair data on each check so runtime add/remove
        # operations are reflected without restarting the exposure process.
        try:
            self._paired_channel_ids = set(get_pairs_for_channel(self._channel_name))
        except Exception as exc:  # pragma: no cover - defensive logging path
            logger.warning("Failed to refresh channel pairs: %s", exc)
            return False

        return channel_id in self._paired_channel_ids

    @staticmethod
    def _parse_approval_decision(text: str) -> tuple[bool, str] | None:
        parts = text.strip().split(maxsplit=1)
        if len(parts) != 2:
            return None
        command = parts[0].lower()
        approval_id = parts[1].strip()
        if command == "approve":
            return True, approval_id
        if command == "reject":
            return False, approval_id
        return None

    def _is_pending_expired(self, pending: _PendingApproval) -> bool:
        return (time.monotonic() - pending.created_at) > self._approval_ttl_seconds

    def _get_tool_confirmation_mode(self, tool_name: str) -> str:
        """Determine confirmation mode for a tool based on its server config.

        Returns:
            "required" if tool needs approval, "optional" if it auto-executes.
        """
        if self._mcp_runtime is None:
            return self._mcp_settings.confirmation_mode

        server_name = self._mcp_runtime.get_tool_server_name(tool_name)
        if server_name is None:
            return self._mcp_settings.confirmation_mode

        server_config = self._mcp_runtime.get_server_config(server_name)
        if server_config is None:
            return self._mcp_settings.confirmation_mode

        # Resolve per-server confirmation setting
        if server_config.tool_confirmation == "default":
            return self._mcp_settings.confirmation_mode
        elif server_config.tool_confirmation == "required":
            return "required"
        else:  # "never"
            return "optional"

    async def _handle_message_with_required_approval(
        self,
        history: list[Message],
        tools: list[ToolSpec],
        channel_id: str,
        execution_context: ToolExecutionContext | None = None,
    ) -> str | MessageResponse:
        assistant_message = await self._provider_complete_message(
            history,
            channel_id=channel_id,
            tools=tools,
            tool_choice=self._tool_choice,
        )
        history.append(assistant_message)

        tool_calls = assistant_message.get("tool_calls") or []
        if not tool_calls:
            return str(assistant_message.get("content") or "")

        # Build server name mapping for the tool calls
        server_names: dict[str, str] = {}
        if self._mcp_runtime is not None:
            for call in tool_calls:
                function_obj = (
                    call.get("function")
                    if isinstance(call, dict)
                    else getattr(call, "function", None)
                )
                if isinstance(function_obj, dict):
                    tool_name = function_obj.get("name")
                else:
                    tool_name = getattr(function_obj, "name", None)
                if tool_name:
                    server_name = self._mcp_runtime.get_tool_server_name(tool_name)
                    if server_name:
                        server_names[tool_name] = server_name

        approval_id = uuid.uuid4().hex[:8]
        self._pending_approvals[channel_id] = _PendingApproval(
            approval_id=approval_id,
            created_at=time.monotonic(),
            tools=tools,
            tool_calls=tool_calls,
            server_names=server_names,
            execution_context=execution_context,
        )
        return self._format_approval_prompt(approval_id, tool_calls, server_names)

    def _format_approval_prompt(
        self,
        approval_id: str,
        tool_calls: list[Any],
        server_names: dict[str, str] | None = None,
    ) -> MessageResponse:
        tool_info: list[str] = []
        tool_names: list[str] = []
        for call in tool_calls:
            function_obj = (
                call.get("function")
                if isinstance(call, dict)
                else getattr(call, "function", None)
            )
            if isinstance(function_obj, dict):
                name = function_obj.get("name")
            else:
                name = getattr(function_obj, "name", None)
            tool_name = str(name or "unknown_tool")
            tool_names.append(tool_name)

            # Add server name if available
            if server_names and tool_name in server_names:
                tool_info.append(f"{server_names[tool_name]}.{tool_name}")
            else:
                tool_info.append(tool_name)

        names = ", ".join(tool_info)
        content = (
            f"Tool execution requires confirmation (id: `{approval_id}`).\n"
            f"Requested tool(s): `{names}`\n"
            f"Reply `approve {approval_id}` to continue or `reject {approval_id}` to cancel."
        )
        return MessageResponse(
            content=content,
            approval_id=approval_id,
            tool_names=tool_names,
            server_names=server_names or {},
        )

    async def _handle_message_with_mixed_approval(
        self,
        history: list[Message],
        tools: list[ToolSpec],
        channel_id: str,
        execution_context: ToolExecutionContext | None = None,
    ) -> str | MessageResponse:
        """Handle message with per-server tool confirmation settings.

        This method splits tool calls into auto-execute and needs-approval groups,
        executes auto-execute tools immediately, and creates pending approvals for
        tools that require confirmation.
        """
        max_tool_rounds = 25

        for _ in range(max_tool_rounds):
            assistant_message = await self._provider_complete_message(
                history,
                channel_id=channel_id,
                tools=tools,
                tool_choice=self._tool_choice,
            )
            history.append(assistant_message)

            tool_calls = assistant_message.get("tool_calls") or []
            if not tool_calls:
                return str(assistant_message.get("content") or "")

            # Split tool calls by confirmation requirement
            auto_execute_calls: list[Any] = []
            needs_approval_calls: list[Any] = []

            for call in tool_calls:
                function_obj = (
                    call.get("function")
                    if isinstance(call, dict)
                    else getattr(call, "function", None)
                )
                if isinstance(function_obj, dict):
                    tool_name = function_obj.get("name")
                else:
                    tool_name = getattr(function_obj, "name", None)

                if tool_name:
                    confirmation_mode = self._get_tool_confirmation_mode(tool_name)
                    if confirmation_mode == "required":
                        needs_approval_calls.append(call)
                    else:
                        auto_execute_calls.append(call)
                else:
                    # Unknown tool name, default to auto-execute
                    auto_execute_calls.append(call)

            # Execute auto-execute tools immediately
            if auto_execute_calls:
                if self._mcp_runtime is None:
                    return "MCP runtime unavailable while handling tool calls."
                await self._execute_tool_calls(
                    history,
                    auto_execute_calls,
                    execution_context=execution_context,
                )

            # If there are tools needing approval, create pending approval
            if needs_approval_calls:
                # Build server name mapping for the tool calls needing approval
                server_names: dict[str, str] = {}
                if self._mcp_runtime is not None:
                    for call in needs_approval_calls:
                        function_obj = (
                            call.get("function")
                            if isinstance(call, dict)
                            else getattr(call, "function", None)
                        )
                        if isinstance(function_obj, dict):
                            tool_name = function_obj.get("name")
                        else:
                            tool_name = getattr(function_obj, "name", None)
                        if tool_name:
                            server_name = self._mcp_runtime.get_tool_server_name(
                                tool_name
                            )
                            if server_name:
                                server_names[tool_name] = server_name

                approval_id = uuid.uuid4().hex[:8]
                self._pending_approvals[channel_id] = _PendingApproval(
                    approval_id=approval_id,
                    created_at=time.monotonic(),
                    tools=tools,
                    tool_calls=needs_approval_calls,
                    server_names=server_names,
                    execution_context=execution_context,
                )
                return self._format_approval_prompt(
                    approval_id, needs_approval_calls, server_names
                )

            # If only auto-execute tools were present, continue the loop

        fallback = (
            "Tool execution exceeded maximum rounds; stopping to avoid infinite loop."
        )
        history.append({"role": "assistant", "content": fallback})
        return fallback

    async def _handle_approval_decision(
        self,
        channel_id: str,
        approved: bool,
        approval_id: str,
    ) -> str:
        pending = self._pending_approvals.get(channel_id)
        if pending is None:
            return "There is no pending tool approval for this chat."

        if self._is_pending_expired(pending):
            del self._pending_approvals[channel_id]
            return "The pending tool approval expired. Please ask again."

        if pending.approval_id != approval_id:
            return (
                f"Unknown approval id '{approval_id}'. "
                f"Pending id is '{pending.approval_id}'."
            )

        history = self._get_or_create_history(channel_id)
        del self._pending_approvals[channel_id]

        if approved:
            if self._mcp_runtime is None:
                return "MCP runtime unavailable while handling approval."
            await self._execute_tool_calls(
                history,
                pending.tool_calls,
                execution_context=pending.execution_context,
                approval_id=approval_id,
            )
            return await self._handle_message_with_mcp_tools(
                history,
                pending.tools,
                execution_context=pending.execution_context,
            )

        for call in pending.tool_calls:
            history.append(
                {
                    "role": "tool",
                    "tool_call_id": self._tool_call_id(call),
                    "content": "User rejected tool execution.",
                }
            )
        reply = await self._provider.complete(history)
        history.append({"role": "assistant", "content": reply})
        return reply

    async def _execute_tool_calls(
        self,
        history: list[Message],
        tool_calls: list[Any],
        *,
        execution_context: ToolExecutionContext | None = None,
        approval_id: str | None = None,
    ) -> None:
        if self._mcp_runtime is None:
            return
        for call in tool_calls:
            call_id = self._tool_call_id(call)
            tool_name = self._tool_call_name(call)
            target_id = self._feedback_target_id(execution_context)
            if tool_name and target_id:
                await self._notify_tool_status(
                    target_id,
                    "running",
                    tool_name,
                    approval_id=approval_id,
                )

            failed = False
            failure_detail: str | None = None
            try:
                tool_result = await asyncio.wait_for(
                    self._mcp_runtime.execute_tool_call(
                        call,
                        execution_context=execution_context,
                    ),
                    timeout=self._mcp_settings.tool_timeout_seconds,
                )
            except TimeoutError:
                tool_result = (
                    "MCP tool execution timed out after "
                    f"{self._mcp_settings.tool_timeout_seconds} seconds."
                )
                failed = True
                failure_detail = tool_result
            except Exception as exc:
                tool_result = f"MCP tool execution failed: {exc}"
                failed = True
                failure_detail = tool_result

            if (
                not failed
                and isinstance(tool_result, str)
                and tool_result.startswith("MCP tool execution failed")
            ):
                failed = True
                failure_detail = tool_result

            if failed and tool_name and target_id:
                await self._notify_tool_status(
                    target_id,
                    "failed",
                    tool_name,
                    approval_id=approval_id,
                    detail=failure_detail,
                )

            history.append(
                {
                    "role": "tool",
                    "tool_call_id": call_id,
                    "content": tool_result,
                }
            )

    async def _notify_tool_status(
        self,
        user_id: str,
        status: str,
        tool_name: str,
        *,
        approval_id: str | None = None,
        detail: str | None = None,
    ) -> None:
        try:
            await self._client.notify_tool_status(
                user_id,
                status,
                tool_name,
                approval_id=approval_id,
                detail=detail,
            )
        except Exception as exc:
            logger.debug(
                "Failed to publish tool status '%s' for '%s': %s",
                status,
                tool_name,
                exc,
            )

    @staticmethod
    def _feedback_target_id(
        execution_context: ToolExecutionContext | None,
    ) -> str | None:
        if execution_context is None:
            return None
        if execution_context.subject_id:
            return execution_context.subject_id
        if execution_context.channel_id:
            return execution_context.channel_id
        return None

    @staticmethod
    def _tool_call_name(tool_call: Any) -> str | None:
        if isinstance(tool_call, dict):
            function_obj = tool_call.get("function")
            if isinstance(function_obj, dict):
                name = function_obj.get("name")
                return str(name) if name else None
            return None

        function_obj = getattr(tool_call, "function", None)
        if function_obj is None:
            return None
        if isinstance(function_obj, dict):
            name = function_obj.get("name")
            return str(name) if name else None
        name = getattr(function_obj, "name", None)
        return str(name) if name else None

    @staticmethod
    def _normalize_usage(raw_usage: Any) -> dict[str, Any] | None:
        if not isinstance(raw_usage, dict):
            return None

        def _coerce_int(value: Any) -> int | None:
            if value is None:
                return None
            try:
                return int(value)
            except (TypeError, ValueError):
                return None

        def _coerce_float(value: Any) -> float | None:
            if value is None:
                return None
            try:
                return float(value)
            except (TypeError, ValueError):
                return None

        prompt_tokens = _coerce_int(raw_usage.get("prompt_tokens"))
        completion_tokens = _coerce_int(raw_usage.get("completion_tokens"))
        total_tokens = _coerce_int(raw_usage.get("total_tokens"))
        cost_usd = _coerce_float(raw_usage.get("cost_usd"))
        latency_ms = _coerce_int(raw_usage.get("latency_ms"))
        model = raw_usage.get("model")

        if (
            total_tokens is None
            and prompt_tokens is not None
            and completion_tokens is not None
        ):
            total_tokens = prompt_tokens + completion_tokens

        if (
            prompt_tokens is None
            and completion_tokens is None
            and total_tokens is None
            and cost_usd is None
        ):
            return None

        return {
            "prompt_tokens": prompt_tokens,
            "completion_tokens": completion_tokens,
            "total_tokens": total_tokens,
            "cost_usd": cost_usd,
            "latency_ms": latency_ms,
            "model": model,
        }

    def _record_provider_usage(self, channel_id: str) -> None:
        get_last_usage = getattr(self._provider, "get_last_usage", None)
        if not callable(get_last_usage):
            return

        raw_usage = get_last_usage()
        usage = self._normalize_usage(raw_usage)
        if usage is None:
            return

        stats = self._usage_by_channel.setdefault(channel_id, _UsageStats())
        stats.last = usage

        prompt_tokens = usage.get("prompt_tokens")
        completion_tokens = usage.get("completion_tokens")
        total_tokens = usage.get("total_tokens")
        cost_usd = usage.get("cost_usd")

        if isinstance(prompt_tokens, int):
            stats.prompt_tokens += prompt_tokens
        if isinstance(completion_tokens, int):
            stats.completion_tokens += completion_tokens
        if isinstance(total_tokens, int):
            stats.total_tokens += total_tokens
        if isinstance(cost_usd, float):
            stats.cost_usd += cost_usd

    async def _provider_complete_message(
        self,
        history: list[Message],
        *,
        channel_id: str,
        tools: list[ToolSpec],
        tool_choice: ToolChoice | None,
    ) -> Message:
        complete_with_message = getattr(self._provider, "complete_with_message", None)
        if callable(complete_with_message):
            maybe_message = complete_with_message(
                history,
                tools=tools,
                tool_choice=tool_choice,
            )
            if inspect.isawaitable(maybe_message):
                message = await maybe_message
                self._record_provider_usage(channel_id)
                if isinstance(message, dict):
                    return message
                if hasattr(message, "model_dump"):
                    dumped = message.model_dump(exclude_none=True)
                    if isinstance(dumped, dict):
                        return dumped
                return {
                    "role": getattr(message, "role", "assistant"),
                    "content": getattr(message, "content", ""),
                    "tool_calls": getattr(message, "tool_calls", None),
                }

        content = await self._provider.complete(
            history,
            tools=tools,
            tool_choice=tool_choice,
        )
        self._record_provider_usage(channel_id)
        return {"role": "assistant", "content": content}

    async def _handle_message_with_mcp_tools(
        self,
        history: list[Message],
        tools: list[ToolSpec],
        *,
        execution_context: ToolExecutionContext | None = None,
    ) -> str:
        """Handle message with auto-execute tools (no approval).

        Delegates to ToolAwareCompletion for reusable tool handling.
        """
        if self._mcp_runtime is None:
            return "MCP runtime unavailable while handling tool calls."

        # Use ToolAwareCompletion with existing runtime
        async with ToolAwareCompletion(
            provider=self._provider,
            mcp_runtime=self._mcp_runtime,
            timeout_seconds=self._mcp_settings.tool_timeout_seconds,
        ) as handler:
            return await handler.complete(
                history,
                execution_context=execution_context,
                max_rounds=8,
            )

    @staticmethod
    def _tool_call_id(tool_call: Any) -> str:
        if isinstance(tool_call, dict):
            return str(tool_call.get("id") or "unknown_tool_call")
        return str(getattr(tool_call, "id", "unknown_tool_call"))

    # ------------------------------------------------------------------
    # Admin command interface (cross-client)
    # ------------------------------------------------------------------

    def _admin_status(self, channel_id: str) -> str:
        """Return a formatted status summary for the given channel."""
        channel_model = getattr(self._config.client, "model_name", None)
        model = channel_model or self._config.provider.model
        exposure = self._config.name

        history = self._histories.get(channel_id, [])
        msg_count = len([m for m in history if m.get("role") != "system"])
        active_channels = len(self._histories)
        pending_count = len(self._pending_approvals)

        lines = [
            f"*📊 Status: `{exposure}`*",
            f"🤖 Model: `{model}`",
            f"📝 History: {msg_count} message(s) in this chat",
            f"📢 Active channels: {active_channels}",
            f"📝 Pending approvals: {pending_count}",
        ]

        if self._mcp_runtime is not None and self._mcp_runtime_initialized:
            tools = self._mcp_runtime.tools
            # Group tools by server
            server_counts: dict[str, int] = {}
            for tool in tools:
                fn = tool.get("function", {})
                name = fn.get("name", "")
                server = self._mcp_runtime.get_tool_server_name(name) or "unknown"
                server_counts[server] = server_counts.get(server, 0) + 1
            summary = ", ".join(f"{s}: {n}" for s, n in server_counts.items())
            lines.append(f"🔧 MCP tools: {len(tools)} ({summary})")
        elif self._mcp_runtime is not None:
            lines.append("🔧 MCP: configured but not yet initialized")
        else:
            lines.append("🔧 MCP: not configured")

        usage_stats = self._usage_by_channel.get(channel_id)
        if usage_stats is None or usage_stats.last is None:
            lines.append("📈 Usage: no provider metrics yet")
            return "\n".join(lines)

        last = usage_stats.last
        last_prompt = last.get("prompt_tokens")
        last_completion = last.get("completion_tokens")
        last_total = last.get("total_tokens")
        lines.append(
            "📈 Usage last: "
            f"p={last_prompt if last_prompt is not None else '-'} "
            f"c={last_completion if last_completion is not None else '-'} "
            f"t={last_total if last_total is not None else '-'}"
        )

        totals_line = (
            "📈 Usage chat: "
            f"p={usage_stats.prompt_tokens} "
            f"c={usage_stats.completion_tokens} "
            f"t={usage_stats.total_tokens}"
        )
        if usage_stats.cost_usd > 0:
            totals_line += f" | est_cost=${usage_stats.cost_usd:.6f}"
        lines.append(totals_line)

        return "\n".join(lines)

    def _admin_clear_history(self, channel_id: str) -> str:
        """Clear conversation history for the given channel."""
        self._histories[channel_id] = []
        self._usage_by_channel.pop(channel_id, None)
        return "✅ Conversation history cleared."

    def _admin_tools(self) -> str:
        """Return a formatted list of available MCP tools grouped by server."""
        if self._mcp_runtime is None or not self._mcp_runtime_initialized:
            return "No MCP tools available (runtime not initialized)."

        tools = self._mcp_runtime.tools
        if not tools:
            return "No MCP tools available."

        # Group by server
        by_server: dict[str, list[str]] = {}
        for tool in tools:
            fn = tool.get("function", {})
            name = fn.get("name", "unknown")
            desc = fn.get("description", "")
            server = self._mcp_runtime.get_tool_server_name(name) or "unknown"
            mode = self._get_tool_confirmation_mode(name)
            tag = "[approval]" if mode == "required" else "[auto]"
            entry = f"  • `{name}` {tag}"
            if desc:
                first_line = desc.split("\n")[0][:80]
                entry += f" — {first_line}"
            by_server.setdefault(server, []).append(entry)

        lines = ["*🔧 MCP Tools*"]
        for server, entries in by_server.items():
            lines.append(f"\n*{server}*")
            lines.extend(entries)
        return "\n".join(lines)

    def _ensure_start_onboarding_context(self, channel_id: str) -> None:
        """Inject onboarding context once per channel when /start is used."""
        history = self._get_or_create_history(channel_id)
        for message in history:
            if message.get("role") != "system":
                continue
            content = message.get("content")
            if isinstance(content, str) and _START_ONBOARDING_CONTEXT_MARKER in content:
                return

        history.append({"role": "system", "content": _START_ONBOARDING_CONTEXT_TEXT})

    def _admin_start(self, channel_id: str) -> str:
        """Return onboarding instructions and inject setup context for this channel."""
        if not self._is_channel_paired(channel_id):
            return (
                f"This instance is not paired. Run `llm-expose add pair {channel_id}`"
            )

        self._ensure_start_onboarding_context(channel_id)
        return _START_ONBOARDING_VISIBLE_TEXT

    async def _admin_reload(self, channel_id: str) -> str:
        """Clear history, reset system prompt cache, and reinitialize MCP."""
        self._histories[channel_id] = []
        self._usage_by_channel.pop(channel_id, None)
        self._loaded_system_prompt = None

        if self._mcp_runtime is not None and self._mcp_runtime_initialized:
            try:
                await self._mcp_runtime.shutdown()
            except Exception as exc:
                logger.warning("Error shutting down MCP runtime during reload: %s", exc)
            self._mcp_runtime_initialized = False

        history = self._get_or_create_history(channel_id)
        history.append({"role": "system", "content": _RELOAD_STARTUP_CONTEXT_TEXT})

        return "✅ Reloaded: history cleared, system prompt reset, MCP will reinitialize on next message."

    @staticmethod
    def _admin_list() -> str:
        """Return the list of available admin commands."""
        return (
            "Available admin commands:\n"
            "/start — begin onboarding and setup guidance\n"
            "/list — show available commands\n"
            "/status — show runtime status\n"
            "/clear — clear conversation history for this chat\n"
            "/tools — list available MCP tools\n"
            "/reload — clear history, reload system prompt and reinitialize MCP"
        )

    async def handle_admin_command(
        self, channel_id: str, command: str, args: list[str] | None = None
    ) -> str:
        """Dispatch an admin command and return the formatted response.

        This is the single cross-client entry point for admin commands. Any
        client (Telegram, Discord, Slack, …) can call this method with the
        bare command name and receive a ready-to-send string response.

        Args:
            channel_id: The channel that issued the command (for scoping).
            command: Bare command name, e.g. ``"status"``, ``"clear"``.
            args: Optional list of additional arguments (reserved for future use).

        Returns:
            A formatted string ready to be sent back to the user.
        """
        cmd = command.lower().strip()
        if cmd == "start":
            return self._admin_start(channel_id)
        if cmd == "list":
            return self._admin_list()
        if cmd == "status":
            return self._admin_status(channel_id)
        if cmd == "clear":
            return self._admin_clear_history(channel_id)
        if cmd == "tools":
            return self._admin_tools()
        if cmd == "reload":
            return await self._admin_reload(channel_id)
        return "Unknown command. Use /list to see available admin commands."

    async def run(self) -> None:
        """Start the client and block until it stops.

        Registers :meth:`_handle_message` as the client's message handler and
        delegates lifecycle management to the client.
        """
        # Wire our handler into the client via the public setter
        self._client.set_handler(self._handle_message)
        logger.info("Starting orchestrator for exposure '%s'", self._config.name)
        try:
            await self._client.start()
        finally:
            if self._mcp_runtime is not None:
                await self._mcp_runtime.shutdown()
            # Always attempt a graceful shutdown to release client resources
            # (e.g., Telegram long-polling session) even on Ctrl+C/errors.
            await self._client.stop()

`handle_admin_command(channel_id, command, args=None)` `async` ¶

Dispatch an admin command and return the formatted response.

This is the single cross-client entry point for admin commands. Any client (Telegram, Discord, Slack, …) can call this method with the bare command name and receive a ready-to-send string response.

Parameters:

Name	Type	Description	Default
`channel_id`	`str`	The channel that issued the command (for scoping).	required
`command`	`str`	Bare command name, e.g. `"status"`, `"clear"`.	required
`args`	`list[str] \| None`	Optional list of additional arguments (reserved for future use).	`None`

Returns:

Type	Description
`str`	A formatted string ready to be sent back to the user.

Source code in llm_expose/core/orchestrator.py

async def handle_admin_command(
    self, channel_id: str, command: str, args: list[str] | None = None
) -> str:
    """Dispatch an admin command and return the formatted response.

    This is the single cross-client entry point for admin commands. Any
    client (Telegram, Discord, Slack, …) can call this method with the
    bare command name and receive a ready-to-send string response.

    Args:
        channel_id: The channel that issued the command (for scoping).
        command: Bare command name, e.g. ``"status"``, ``"clear"``.
        args: Optional list of additional arguments (reserved for future use).

    Returns:
        A formatted string ready to be sent back to the user.
    """
    cmd = command.lower().strip()
    if cmd == "start":
        return self._admin_start(channel_id)
    if cmd == "list":
        return self._admin_list()
    if cmd == "status":
        return self._admin_status(channel_id)
    if cmd == "clear":
        return self._admin_clear_history(channel_id)
    if cmd == "tools":
        return self._admin_tools()
    if cmd == "reload":
        return await self._admin_reload(channel_id)
    return "Unknown command. Use /list to see available admin commands."

`run()` `async` ¶

Start the client and block until it stops.

Registers :meth:_handle_message as the client's message handler and delegates lifecycle management to the client.

Source code in llm_expose/core/orchestrator.py

async def run(self) -> None:
    """Start the client and block until it stops.

    Registers :meth:`_handle_message` as the client's message handler and
    delegates lifecycle management to the client.
    """
    # Wire our handler into the client via the public setter
    self._client.set_handler(self._handle_message)
    logger.info("Starting orchestrator for exposure '%s'", self._config.name)
    try:
        await self._client.start()
    finally:
        if self._mcp_runtime is not None:
            await self._mcp_runtime.shutdown()
        # Always attempt a graceful shutdown to release client resources
        # (e.g., Telegram long-polling session) even on Ctrl+C/errors.
        await self._client.stop()

Orchestrator API¶

llm_expose.core.orchestrator ¶

Orchestrator ¶

handle_admin_command(channel_id, command, args=None) async ¶

run() async ¶

`llm_expose.core.orchestrator` ¶

`Orchestrator` ¶

`handle_admin_command(channel_id, command, args=None)` `async` ¶

`run()` `async` ¶