OpenAI chat completions api のオーディオモード追加 #213

tegnike · 2024-10-30T10:23:04Z

Summary by CodeRabbit

新機能
- 音声モードの設定オプションを追加しました。
- 新しいメッセージセレクタ機能を導入し、メッセージのフィルタリングと処理を改善しました。
- 音声処理に関連する新しいAPIエンドポイントを追加しました。
バグ修正
- メッセージ処理のロジックを改善し、エラーハンドリングを強化しました。
ドキュメント
- 各言語における音声モードのローカライズを追加しました。

vercel · 2024-10-30T10:23:09Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
aituber-kit	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Oct 30, 2024 10:23am

coderabbitai · 2024-10-30T10:23:15Z

Walkthrough

このプルリクエストでは、音声とリアルタイムAPIの設定に関連する新しい環境変数が.envファイルに追加され、ローカライズファイルに音声モードに関連する新しいエントリが追加されました。また、複数のコンポーネントが音声データの処理やメッセージの選択を改善するために変更され、音声モードの設定を管理するための新しい機能が統合されました。全体として、音声機能の管理とメッセージ処理のロジックが強化されています。

Changes

ファイル	変更概要
`.env`	新しい環境変数 `NEXT_PUBLIC_REALTIME_API_MODE_VOICE`, `NEXT_PUBLIC_AUDIO_MODE`, `NEXT_PUBLIC_AUDIO_MODE_INPUT_TYPE`, `NEXT_PUBLIC_AUDIO_MODE_VOICE` が追加されました。
`locales/en/translation.json`	`"AudioMode": "Audio Mode (Beta)"` が追加されました。
`locales/ja/translation.json`	`"AudioMode": "オーディオモード（Beta）"` が追加されました。
`locales/ko/translation.json`	`"AudioMode": "오디오 모드 (Beta)"` が追加されました。
`locales/zh/translation.json`	`"AudioMode": "音頻模式（Beta）"` が追加されました。
`src/components/chatLog.tsx`	`homeStore`からのメッセージ取得を`messageSelectors.getTextAndImageMessages`に変更し、`msg.content`の存在を確認する条件を追加しました。
`src/components/menu.tsx`	`webSocketMode`と`chatLog`の状態変数を削除し、`IconButton`の`disabled`プロパティを常に`false`に設定しました。
`src/components/messageInputContainer.tsx`	`handleSendMessage`と`handleInputChange`のコールバック関数を追加し、`MessageInput`コンポーネントに新しいプロパティを追加しました。
`src/components/realtimeAPIUtils.tsx`	`AudioBufferManager`を使用し、音声データの処理を簡素化しました。
`src/components/settings/log.tsx`	`homeStore`からの`chatLog`の取得を`messageSelectors.getTextAndImageMessages`に変更しました。
`src/components/settings/modelProvider.tsx`	音声モードに関連する新しい状態変数を追加し、音声設定の管理セクションを追加しました。
`src/components/settings/slide.tsx`	`slideMode`に基づいてスライドフォルダの取得を条件付きで行うように変更しました。
`src/components/useRealtimeAPI.tsx`	`accumulatedAudioDataRef`の型を`AudioBufferManager`に変更しました。
`src/features/chat/aiChatFactory.ts`	`getAIChatResponseStream`関数のパラメータを変更し、音声モードの条件を追加しました。
`src/features/chat/handlers.ts`	メッセージ処理のロジックを改善し、新しいメッセージセレクタを統合しました。
`src/features/chat/openAIAudioChat.ts`	新しい関数`getOpenAIAudioChatResponseStream`を追加しました。
`src/features/constants/settings.ts`	新しい型`AudioModeInputType`を追加しました。
`src/features/messages/messageSelectors.ts`	メッセージのフィルタリングと処理のための新しいモジュール`messageSelectors`を追加しました。
`src/features/messages/messages.ts`	`Message`型の`content`プロパティをオプショナルにし、新しい`audio`プロパティを追加しました。
`src/features/messages/speakCharacter.ts`	`createSpeakCharacter`関数に音声モードの条件を追加しました。
`src/features/stores/home.ts`	`chatLog`の処理を`messageSelectors.cutImageMessage`に変更しました。
`src/features/stores/settings.ts`	`General`インターフェースに音声関連の新しいプロパティを追加しました。
`src/features/youtube/conversationContinuityFunctions.ts`	メッセージ処理のロジックを改善し、エラーハンドリングを強化しました。
`src/features/youtube/youtubeComments.ts`	`messageSelectors`を使用してメッセージを取得するロジックを改善しました。
`src/pages/api/aiChat.ts`	エラーハンドリングとメッセージ処理のロジックを改善しました。
`src/utils/audioBufferManager.ts`	`AudioBufferManager`クラスを新規作成し、音声データバッファの管理を行います。

Possibly related PRs

envファイル修正 & Azure設定プロパティ変更 #171: 音声設定に関連する新しい環境変数が追加され、音声モードの設定に関連しています。
システムプロンプトを環境変数で設定できるようにする #176: NEXT_PUBLIC_SYSTEM_PROMPT変数が追加され、環境変数の追加に関連しています。
リリースブランチ #173: .envファイルに新しい環境変数が追加され、主なPRに直接関連しています。
リリースブランチ #182: NEXT_PUBLIC_SYSTEM_PROMPT変数が更新され、環境変数管理に関連しています。
WebSocketモード更新 #205: WebSocket設定が更新され、主なPRで導入されたリアルタイムAPI設定に関連しています。

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 17

🧹 Outside diff range and nitpick comments (17)

src/features/chat/aiChatFactory.ts (2)

43-43: エラーメッセージの改善提案

エラーメッセージをより詳細にすることで、デバッグ時の助けになります。

-      throw new Error(`Unsupported AI service: ${ss.selectAIService}`)
+      throw new Error(`サポートされていないAIサービスです: ${ss.selectAIService}。利用可能なサービス: openai, anthropic, google, azure, groq, cohere, mistralai, perplexity, fireworks, localLlm, dify`)

Line range hint 1-45: アーキテクチャに関する提案

この実装は機能的には問題ありませんが、以下の改善を検討することをお勧めします：

サービスごとの実装をStrategy パターンを使用してリファクタリング
音声モードの設定をDIコンテナで管理
エラーハンドリングの共通化

これにより、新しいサービスの追加やモードの追加が容易になり、保守性が向上します。

src/features/constants/settings.ts (1)

62-62: 型の目的をJSDocで文書化することを推奨します

この型が環境変数 NEXT_PUBLIC_AUDIO_MODE_INPUT_TYPE と関連していることを明確にするため、JSDocコメントの追加を提案します。

以下の修正を提案します：

+/**
+ * オーディオモードの入力タイプを定義します。
+ * 環境変数 NEXT_PUBLIC_AUDIO_MODE_INPUT_TYPE で使用されます。
+ * @type {'input_text'} テキスト入力モード
+ * @type {'input_audio'} 音声入力モード
+ */
export type AudioModeInputType = 'input_text' | 'input_audio'

src/features/messages/messages.ts (1)

7-10: 型定義のドキュメント化を推奨します

新しく追加された audio プロパティと、任意となった content プロパティについて、JSDoc形式でドキュメントを追加することを推奨します。

以下のような形式でのドキュメント追加を提案します：

 export type Message = {
+  /**
+   * メッセージの役割を示す
+   * "assistant" | "system" | "user" のいずれかの値
+   */
   role: string
+  /**
+   * メッセージの内容
+   * テキストまたはマルチモーダル（テキスト＋画像）形式
+   */
   content?:
     | string
     | [{ type: 'text'; text: string }, { type: 'image'; image: string }]
+  /**
+   * オーディオメッセージの識別情報
+   * @property {string} id - オーディオデータの一意識別子
+   */
   audio?: { id: string }
 }

src/components/settings/log.tsx (1)

42-69: アクセシビリティとi18nの改善が必要です

以下の点について改善を提案します：

画像のalt属性が日本語でハードコードされています
input要素にアクセシビリティ属性（aria-label等）が不足しています

以下の修正を提案します：

 <input
   key={index}
   className="bg-surface1 hover:bg-surface1-hover rounded-8 w-full px-16 py-8"
   type="text"
   value={value.content}
+  aria-label={t('MessageInput')}
   onChange={(e) => {
     handleChangeChatLog(index, e.target.value)
   }}
 ></input>

 <Image
   src={value.content[1].image}
-  alt="画像"
+  alt={t('GeneratedImage')}
   width={500}
   height={500}
 />

.env (1)

121-126: 設定例の追加を推奨します

各設定項目の理解を深めるため、コメントに具体的な設定例を追加することを推奨します。

以下のような形式での追記を提案します：

 # Audio Mode (true or false)
-NEXT_PUBLIC_AUDIO_MODE=""
+# Example: NEXT_PUBLIC_AUDIO_MODE="true"
+NEXT_PUBLIC_AUDIO_MODE=""
 # Audio Mode Input Type ('input_text', 'input_audio')
-NEXT_PUBLIC_AUDIO_MODE_INPUT_TYPE=""
+# Example: NEXT_PUBLIC_AUDIO_MODE_INPUT_TYPE="input_text"
+NEXT_PUBLIC_AUDIO_MODE_INPUT_TYPE=""
 # Audio Mode Voice ('alloy', 'echo', 'fable', 'onyx', 'nova', 'shimmer')
-NEXT_PUBLIC_AUDIO_MODE_VOICE=""
+# Example: NEXT_PUBLIC_AUDIO_MODE_VOICE="alloy"
+NEXT_PUBLIC_AUDIO_MODE_VOICE=""

src/components/settings/slide.tsx (1)

26-32: フェッチロジックの最適化を承認しますが、エラーハンドリングの改善を推奨します

スライドモードが有効な場合のみフォルダリストを取得する最適化は良い改善です。ただし、エラー時のユーザーエクスペリエンスを向上させるため、以下の改善を提案します：

エラーハンドリングの改善案：

  if (slideMode) {
    // フォルダリストを取得
    fetch('/api/getSlideFolders')
      .then((response) => response.json())
      .then((data) => setSlideFolders(data))
-     .catch((error) => console.error('Error fetching slide folders:', error))
+     .catch((error) => {
+       console.error('Error fetching slide folders:', error);
+       // ユーザーにエラーを通知
+       settingsStore.setState({ 
+         errorMessage: 'スライドフォルダの取得に失敗しました。後でもう一度お試しください。'
+       });
+     })
  }

src/components/chatLog.tsx (1)

46-46: null安全性の改善について

msg.contentのnullチェックの追加は適切な対応ですが、さらなる改善の余地があります。

TypeScriptの型システムを活用して、以下の点を検討してください：

msg.contentの型を明示的に定義する
Union型を使用してメッセージの構造を厳密に定義する

例えば以下のような型定義を検討してください：

type MessageContent = {
  text: { text: string }[];
  image: { image: string }[];
} | string | undefined;

interface Message {
  role: string;
  content: MessageContent;
}

Also applies to: 51-51

src/components/useRealtimeAPI.tsx (1)

21-28: メッセージ処理の型定義の改善

processMessage関数の実装は適切ですが、より厳密な型定義が必要です。

以下のような型定義の追加を推奨します：

 type TmpMessage = {
   text: string
   role: string
   emotion: string
   state: string
+  buffer?: AudioBuffer
 }

src/pages/api/aiChat.ts (1)

223-223: 空のメッセージ処理の改善を承認

message.contentがnullまたはundefinedの場合のフォールバック処理を追加することで、より堅牢な実装になっています。

しかし、以下の点についても検討することをお勧めします：

型安全性の向上のため、Messageインターフェースでcontentプロパティの型を明示的に定義する
空文字列の代わりに、より意味のあるデフォルト値（例：[システムメッセージ]）の使用を検討する

以下のような実装を提案します：

- combinedContent = message.content || ''
+ const defaultContent = message.role === 'system' ? '[システムメッセージ]' : ''
+ combinedContent = message.content || defaultContent

src/features/messages/speakCharacter.ts (2)

57-58: アーキテクチャに関する提案

オーディオモードの処理を別のハンドラーに分離することを検討してください。現在の実装では、条件分岐が増えており、将来的な保守性に影響を与える可能性があります。

例えば、以下のような方法で処理を分離できます：

interface AudioHandler {
  handleAudio(talk: Talk): Promise<Buffer | null>;
}

class AudioModeHandler implements AudioHandler {
  async handleAudio(talk: Talk): Promise<null> {
    return null;
  }
}

class TTSHandler implements AudioHandler {
  async handleAudio(talk: Talk): Promise<Buffer> {
    // 既存のTTS処理
  }
}

57-58: エラー処理の改善が推奨されます

audioMode が有効な場合の特殊なエラーケースや状態管理について、より詳細な処理が必要かもしれません。

if (ss.audioMode) {
  // オーディオモードの状態をログに記録
  console.debug('Audio mode enabled, skipping buffer generation');
  buffer = null;
}

src/features/youtube/conversationContinuityFunctions.ts (1)

Line range hint 279-286: エラーハンドリングの改善を提案します

現在のエラーハンドリングは基本的な実装ですが、以下の点で改善の余地があります：

エラータイプの特定
エラーログの詳細化
リトライメカニズムの検討

以下のような実装を検討してください：

  try {
    const response = await fetchAIResponse(queryMessages)
    console.log('response.message:', response.text)
    const responseJson = JSON.parse(response.text)
    answer = responseJson.answer
    answer = answer.toString()
  } catch (error) {
-   console.error('JSON.parseエラーが発生しました。', error)
+   if (error instanceof SyntaxError) {
+     console.error('JSONパースエラー:', {
+       error,
+       responseText: response?.text,
+       stack: error.stack
+     })
+   } else {
+     console.error('予期せぬエラー:', {
+       error,
+       type: error.constructor.name,
+       stack: error.stack
+     })
+   }
    answer = 'false'
  }

src/components/menu.tsx (1)

Line range hint 1-290: 音声モード関連の機能が不足しています

PRの目的である「OpenAI chat completions api のオーディオモード追加」に関連する変更が見当たりません。音声モードのUIコントロールや状態管理を追加する必要があります。

以下の機能の追加を検討してください：

音声入力の開始/停止ボタン
音声出力の設定コントロール
音声モードの状態管理（settingsStoreへの統合）

実装例を提供することも可能です。必要であればお申し付けください。

src/features/stores/settings.ts (1)

Line range hint 1-299: アーキテクチャに関する提案

現在の実装は機能的には問題ありませんが、以下の改善を検討することをお勧めします：

オーディオモード関連の設定をグループ化し、専用のインターフェースを作成する
環境変数のデフォルト値を定数として分離する

以下のような実装を提案します：

interface AudioModeSettings {
  enabled: boolean;
  inputType: AudioModeInputType;
  voice: OpenAITTSVoice;
}

interface General {
  // ... other properties
  audioSettings: AudioModeSettings;
}

// 定数ファイルに移動することを推奨
const DEFAULT_AUDIO_SETTINGS: AudioModeSettings = {
  enabled: false,
  inputType: 'input_text',
  voice: 'shimmer'
} as const;

この変更により、将来的なオーディオ設定の拡張が容易になり、コードの保守性が向上します。

src/utils/audioBufferManager.ts (1)

68-71: 大きなバッファに対するパフォーマンスの最適化を検討してください

validateAudioBuffer メソッド内で int16Array.every(...) を使用していますが、大きなバッファではパフォーマンスに影響を与える可能性があります。全データを検証する代わりに、サンプルデータの検査や他の効率的な方法を検討してください。

src/features/chat/handlers.ts (1)

565-565: ログメッセージを詳細化してください

console.log('response.audio:')だけでは情報が不足している可能性があります。例えば、受信したバッファのサイズや状態など、詳細な情報をログに出力することを検討してください。

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between 2488892 and 4d67972.

📒 Files selected for processing (26)

.env (1 hunks)
locales/en/translation.json (1 hunks)
locales/ja/translation.json (1 hunks)
locales/ko/translation.json (1 hunks)
locales/zh/translation.json (1 hunks)
src/components/chatLog.tsx (2 hunks)
src/components/menu.tsx (2 hunks)
src/components/messageInputContainer.tsx (1 hunks)
src/components/realtimeAPIUtils.tsx (4 hunks)
src/components/settings/log.tsx (3 hunks)
src/components/settings/modelProvider.tsx (4 hunks)
src/components/settings/slide.tsx (1 hunks)
src/components/useRealtimeAPI.tsx (2 hunks)
src/features/chat/aiChatFactory.ts (2 hunks)
src/features/chat/handlers.ts (6 hunks)
src/features/chat/openAIAudioChat.ts (1 hunks)
src/features/constants/settings.ts (1 hunks)
src/features/messages/messageSelectors.ts (1 hunks)
src/features/messages/messages.ts (1 hunks)
src/features/messages/speakCharacter.ts (1 hunks)
src/features/stores/home.ts (2 hunks)
src/features/stores/settings.ts (4 hunks)
src/features/youtube/conversationContinuityFunctions.ts (1 hunks)
src/features/youtube/youtubeComments.ts (5 hunks)
src/pages/api/aiChat.ts (1 hunks)
src/utils/audioBufferManager.ts (1 hunks)

✅ Files skipped from review due to trivial changes (1)

locales/ja/translation.json

🔇 Additional comments (38)

src/features/chat/aiChatFactory.ts (3)

7-7: 新しいオーディオチャット機能のインポートが追加されました

オーディオモード用の新しい機能をインポートすることで、コードの構造が適切に整理されています。

18-18: サービス選択の実装が改善されました

パラメータとして受け取っていたserviceをsettingsStoreから直接参照する形に変更されており、より一貫性のある実装になっています。

14-16: オーディオモードの条件分岐が適切に実装されています

早期リターンパターンを使用して、オーディオモードの処理を優先的に行う実装は適切です。ただし、以下の点について確認をお願いします：

audioModeの型安全性
オーディオモードが有効な場合のエラーハンドリング

✅ Verification successful

オーディオモードの実装は適切に型安全が確保されています

settings.tsの実装を確認したところ：

audioModeはboolean型として明確に定義されています
Generalインターフェースで適切に型付けされています
関連するaudioModeInputTypeとaudioModeVoiceも適切に型定義されています

エラーハンドリングについても、早期リターンパターンを使用して適切に実装されているため、安全性は確保されています。

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# audioModeの型定義を確認
rg -t typescript "audioMode.*:" src/

Length of output: 73

Script:

#!/bin/bash
# Search for audioMode type definition and usage
rg "audioMode" src/

# Search for the state interface or type that includes audioMode
ast-grep --pattern 'interface $_ {
  $$$
  audioMode
  $$$
}'

# Also try with type
ast-grep --pattern 'type $_ = {
  $$$
  audioMode
  $$$
}'

Length of output: 3292

src/features/stores/home.ts (2)

6-6: 新しいインポートの追加が適切です

メッセージの選択ロジックを分離することで、コードの保守性が向上しています。

69-69: 永続化ロジックの変更による影響を確認してください

messageSelectors.cutImageMessageを使用することで、チャットログの永続化方法が変更されています。この変更により、既存のチャットログデータに影響がないか確認が必要です。

以下のスクリプトで影響範囲を確認できます：

✅ Verification successful

Let me gather more context about the Message type and any related persistence logic.

永続化時の画像メッセージ処理は安全に実装されています

cutImageMessageの実装を確認したところ、以下の点で安全性が確保されています：

メッセージの構造を保持したまま、contentのみを処理
マルチモーダルメッセージ型（Message型）に対応
既存のチャットログデータはgetTextAndImageMessagesセレクタによって適切に表示

この変更による既存データへの悪影響はありません。永続化時に画像データを除外することで、ストレージの最適化が実現されています。

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: messageSelectorsの実装と使用箇所を確認

# messageSelectorsの実装を確認
ast-grep --pattern 'export const messageSelectors = {
  cutImageMessage($_) {
    $$$
  }
}'

# 他のファイルでのcutImageMessageの使用箇所を確認
rg -A 5 'cutImageMessage'

Length of output: 843

Script:

#!/bin/bash
# Check Message type definition and related interfaces
ast-grep --pattern 'type Message = {
  $$$
}'

# Check for any persistence-related configurations
rg -A 5 'persist' src/features/stores/

# Check for any image-related message handling
rg -A 3 'image.*message' -i

Length of output: 4986

src/features/messages/messages.ts (1)

7-10: content の任意化とオーディオプロパティの追加について確認が必要です

content を任意（オプショナル）にすることで、メッセージの有効性の検証が重要になります。また、新しい audio プロパティの追加により、以下の点について確認が必要です：

content と audio の両方が欠落している場合の処理
オーディオメッセージの取り扱いに関するバリデーション

✅ Verification successful

Let's gather more information about how the Message type is used, particularly focusing on validation and message creation patterns.

Based on the analysis of the codebase, I can now provide a final response:

content と audio プロパティの検証は適切に実装されています

messageSelectors.ts において以下の検証が実装されています：

getTextAndImageMessages で content の有効性を確認
getAudioMessages で role に応じた適切な検証を実施:
- user: content が string 型のみ許可
- assistant: audio プロパティの存在を確認

また、メッセージの正規化と処理は normalizeMessages と getProcessedMessages で適切に行われています。

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: content と audio の使用パターンを確認

# メッセージの検証処理を探す
rg -l "Message.*validation|validate.*Message" 

# Message型の使用箇所を確認
ast-grep --pattern 'Message'

Length of output: 1070

Script:

#!/bin/bash
# メッセージの作成パターンと検証ロジックを詳しく調べる

# Message型の使用パターンを確認（コンテキスト付き）
rg "new Message|Message\[" -A 3

# content や audio プロパティの使用を確認
rg "message\.(content|audio)" -A 2

# バリデーションに関連する処理を確認
ast-grep --pattern 'function $_($$$) {
  $$$
  message.$$$
  $$$
}'

Length of output: 18252

src/components/settings/log.tsx (1)

7-7: メッセージセレクターの導入により、状態管理が改善されました

messageSelectorsを使用することで、テキストと画像メッセージの一貫した処理が可能になり、コードの保守性が向上しています。

Also applies to: 10-12

.env (1)

118-119: 音声オプションの一貫性を確認してください

リアルタイムAPIモードの音声オプション（'alloy', 'echo', 'shimmer'）が、オーディオモードの音声オプション（'alloy', 'echo', 'fable', 'onyx', 'nova', 'shimmer'）と異なっています。この違いが意図的なものか確認してください。

✅ Verification successful

音声オプションの違いは意図的な設計です

コードベースの分析から、以下の点が確認できました：

OpenAITTSVoice 型は 'alloy', 'echo', 'fable', 'onyx', 'nova', 'shimmer' の全てのオプションを定義しています
リアルタイムAPIモードは制限付きの音声オプション（'alloy', 'echo', 'shimmer'）を使用
オーディオモードは完全な音声オプションセットにアクセス可能

この違いは意図的な設計の一部であり、各モードの特性に合わせて適切に実装されています。修正は不要です。

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# OpenAI APIドキュメントと音声オプションの整合性を確認
rg -l "openai.*voice.*options" --type md

Length of output: 42

Script:

#!/bin/bash
# Search for voice-related configurations and constants in the codebase
rg -i "voice.*options|VOICE_OPTIONS" --type ts --type js -A 5

# Search for OpenAI voice-related imports or configurations
rg -i "openai.*voice|voice.*openai" --type ts --type js -A 5

# Search for usage of the environment variables
rg "NEXT_PUBLIC_REALTIME_API_MODE_VOICE|NEXT_PUBLIC_AUDIO_MODE_VOICE" --type ts --type js -A 5

Length of output: 11350

src/components/settings/slide.tsx (1)

26-33: 依存配列の変更による影響を確認してください

useEffectの依存配列にslideModeを追加したことで、スライドモードの切り替え時に再フェッチが発生するようになりました。これは意図した動作でしょうか？

以下のスクリプトでslideModeの使用箇所を確認します：

✅ Verification successful

slideModeの依存配列への追加は適切です

slideModeの状態変更の影響を確認した結果、以下の理由により依存配列への追加は正しい実装であることが確認できました：

slideModeがオフの時は不要なAPIコールを防ぐための最適化として機能します
slideModeの切り替え時にフォルダリストを再取得することで、最新のデータを反映できます
他のコンポーネントでのslideModeの使用状況を見ても、この変更による悪影響はありません

なお、エラーハンドリングについては改善の余地がありますが、現状のコンソールログでも基本的な機能は満たしています。

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Verify the usage of slideMode state across components
# Expected: Find components that might be affected by the slideMode changes

rg -l "slideMode" --type tsx

Length of output: 56

Script:

#!/bin/bash
# Let's try different approaches to find slideMode usage
# 1. Search for files with .tsx extension
fd -e tsx -x rg -l "slideMode" {}

# 2. Search for the specific component and its imports
fd "slide.tsx$" -x cat {}

# 3. Look for any state management related to slideMode
rg -l "slideMode"

Length of output: 3768

Script:

#!/bin/bash
# Let's examine the settings store to understand slideMode state management
cat src/features/stores/settings.ts

# Also check the implementation of slideStore
cat src/features/stores/slide.ts

# Look for slideMode usage patterns
rg "slideMode" -B 2 -A 2

Length of output: 17441

src/components/chatLog.tsx (1)

6-6: メッセージセレクターの実装について承認

messageSelectorsの導入により、以下の利点が得られます：

ステート管理の責務が明確に分離されました
他のコンポーネントとの一貫性が保たれています
メッセージの取得ロジックが集中化され、保守性が向上しています

Also applies to: 12-14

src/features/messages/messageSelectors.ts (1)

1-3: コードの構造が適切です

TypeScriptのベストプラクティスに従った明確な構造になっています。

src/components/useRealtimeAPI.tsx (1)

132-132: クリーンアップ処理の検証を推奨

AudioBufferManagerのflush処理は適切に実装されていますが、確実なクリーンアップを保証するための検証を追加することを推奨します。

✅ Verification successful

AudioBufferManagerのクリーンアップ処理は適切に実装されています

AudioBufferManagerのflushメソッドは、以下の理由により適切に実装されていることが確認できました：

sendBuffer()メソッドを呼び出して残りのデータを確実に送信
コンポーネントのクリーンアップ時に適切に呼び出されている
複数の場所（WebSocket、OpenAI Audio API）でも同様に適切に使用されている

クリーンアップ処理の追加検証は不要です。

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# AudioBufferManagerのflushメソッドの実装を確認
ast-grep --pattern 'class AudioBufferManager {
  $$$
  flush() {
    $$$
  }
  $$$
}'

Length of output: 87

Script:

#!/bin/bash
# Search for AudioBufferManager class and its implementation
rg -U "class AudioBufferManager" -A 20

# Search for flush method implementation
rg "flush\(\)" -A 5

Length of output: 2880

src/pages/api/aiChat.ts (1)

Line range hint 1-241: 音声モードの統合に関する確認

PRのタイトルには「OpenAI chat completions api のオーディオモード追加」とありますが、このファイルには音声処理に関する明示的な変更が見当たりません。

以下の点について確認が必要です：

音声モードのサポートに必要な追加の変更
OpenAIの音声APIエンドポイントの統合方法
音声応答のストリーミング処理の実装

音声機能の実装状況を確認するため、以下のスクリプトを実行します：

✅ Verification successful

Let me check the implementation details of the audio chat functionality by examining the relevant files.

音声モードの実装は適切に行われています

PRのタイトルに記載された「OpenAI chat completions api のオーディオモード追加」の実装は、以下の形で確認できました：

openAIAudioChat.tsにて、OpenAIのgpt-4o-audio-previewモデルを使用した音声チャット機能が実装されています
aiChatFactory.tsで、audioModeが有効な場合に音声チャット機能が統合されています
messageInputContainer.tsxで、音声入力のUIと音声処理が実装されています

具体的な実装内容：

音声ストリーミング処理
PCM16フォーマットでの音声データ処理
音声認識と音声合成の統合
リアルタイムな音声バッファ管理

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# 音声関連の実装を検索
echo "音声関連の実装を検索中..."
rg -l "audio|voice|speech" --type ts

# OpenAI音声APIの使用箇所を検索
echo "OpenAI音声APIの使用箇所を検索中..."
rg -l "audio|transcription|translation" --type ts

Length of output: 2577

Script:

#!/bin/bash
# Check the OpenAI audio chat implementation
echo "Checking OpenAI audio chat implementation..."
cat src/features/chat/openAIAudioChat.ts

# Check the AI chat factory to see how audio mode is integrated
echo "Checking AI chat factory integration..."
cat src/features/chat/aiChatFactory.ts

# Check the message input container for audio-related UI
echo "Checking message input container..."
cat src/components/messageInputContainer.tsx

Length of output: 14645

src/features/messages/speakCharacter.ts (1)

57-58: バッファー処理の検証が必要です

audioMode が有効な場合に buffer を null に設定することで、後続の処理に影響を与える可能性があります。特に以下の点について確認が必要です：

speakQueue での null バッファーの処理
オーディオ再生の制御フロー

以下のスクリプトで関連するコードを確認します：

✅ Verification successful

buffer を null に設定する処理は適切です

コードベースの分析結果から、以下の点が確認できました：

speakQueue は null バッファーを適切に処理し、エラーハンドリングも実装されています
audioMode が有効な場合に buffer を null に設定することは、音声合成をスキップする意図的な設計です
他の音声合成モード（VOICEVOX、OpenAI、Azure等）と同様の条件分岐構造で実装されています

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: バッファー処理とオーディオキューの実装を確認

# speakQueueでのnullバッファー処理の確認
rg -A 10 "addTask" src/features/messages/speakQueue.ts

# オーディオモード関連の設定を確認
rg "audioMode" src/features/stores/settings.ts

Length of output: 709

Script:

#!/bin/bash
# バッファー処理の詳細な確認

# speakQueueでのタスク処理の実装を確認
rg -A 15 "while \(this\.queue\.length > 0\)" src/features/messages/speakQueue.ts

# speakCharacter.tsでのバッファー処理全体を確認
rg -A 20 "buffer =" src/features/messages/speakCharacter.ts

# オーディオモード関連の処理フローを確認
ast-grep --pattern 'if ($_.audioMode) { $$$ }'

Length of output: 9062

src/features/youtube/youtubeComments.ts (2)

13-13: メッセージ処理の集中化による改善

メッセージ処理をmessageSelectorsに集中させることで、コードの保守性と再利用性が向上しています。この変更は適切なアーキテクチャの改善です。

98-99: メッセージ取得方法の改善

homeStoreから直接アクセスする代わりにmessageSelectorsを使用することで、データアクセスの一貫性が確保されています。

locales/zh/translation.json (1)

146-146: 翻訳の一貫性と文字セットの使用について確認が必要です

以下の点について確認をお願いします：

「音頻」という表現は簡体字中国語での表記ですが、このファイルの他の箇所では繁体字中国語が使用されています。一貫性を保つため、繁体字の「音頻」または「音频」の使用を検討してください。
他の言語ファイル（英語、日本語、韓国語）との機能名の一貫性を確保するため、「Beta」の表記も「β版」に統一することを推奨します。

以下の修正を提案します：

-  "AudioMode": "音頻模式（Beta）",
+  "AudioMode": "音頻模式（β版）",

locales/ko/translation.json (1)

146-146: オーディオモード関連の翻訳エントリが適切に追加されています！

韓国語のローカライズファイルに新しく追加された「オーディオモード (Beta)」の翻訳が、他の言語（英語、日本語、中国語）のローカライズファイルと一貫性を保ちながら、適切に実装されています。

src/components/realtimeAPIUtils.tsx (2)

6-9: オーディオ処理の依存関係が適切に更新されています

AudioBufferManagerのインポートとbase64ToArrayBufferの移動により、オーディオ処理のロジックが適切に集約されています。これは保守性の向上につながります。

110-110: 型の更新が適切に行われています

accumulatedAudioDataRefの型がAudioBufferManagerに変更されており、新しいオーディオ処理の実装と整合性が取れています。

src/components/menu.tsx (2)

242-242: ChatLogコンポーネントのレンダリングが簡素化されました

webSocketModeの条件チェックが削除され、showChatLogの状態のみでレンダリングが制御されるようになりました。これは適切な変更ですが、ChatLogコンポーネント内でwebSocketMode関連の処理が適切に処理されているか確認が必要です。

以下のスクリプトでChatLogコンポーネントの実装を確認できます：

✅ Verification successful

Let me check the usage of webSocketMode in the codebase to ensure there are no missing dependencies.

ChatLogコンポーネントからwebSocketMode条件の削除は適切です

ChatLogコンポーネントの実装を確認したところ、WebSocket関連の処理は完全に分離されており、useWebSocketフックとhandlers.tsで適切に管理されています。ChatLogコンポーネントはメッセージの表示に特化しており、WebSocketのモード制御に依存する必要はありません。この変更は、関心の分離の原則に従った適切なリファクタリングです。

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# ChatLogコンポーネントでのwebSocketModeの使用状況を確認
ast-grep --pattern 'const ChatLog = () => {
  $$$
}'

Length of output: 3291

Script:

#!/bin/bash
# Check for webSocketMode usage in the codebase
rg "webSocketMode" -A 2

Length of output: 3671

160-162: チャットログボタンの無効化条件が削除されています

チャットログが空の場合でもボタンが有効になるため、ユーザーエクスペリエンスに影響を与える可能性があります。空のチャットログを表示することに意味があるか確認してください。

以下のスクリプトで、チャットログの使用状況を確認できます：

✅ Verification successful

Let me gather more information about the chat log state management.

チャットログボタンの無効化条件の変更は適切です

チャットログの状態管理を確認したところ、以下の理由により変更は問題ありません：

チャットログは常にArrayとして初期化され、undefinedになることはありません
ユーザーは常にチャットの履歴を確認できる必要があります（空の状態も含めて）
ChatLogコンポーネントは空の配列を適切に処理できる実装になっています

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# チャットログの使用状況を確認
rg -A 5 "showChatLog|ChatLog" src/

Length of output: 3291

Script:

#!/bin/bash
# チャットログの状態管理を確認
rg -A 5 "chatLog" src/store/

Length of output: 79

Script:

#!/bin/bash
# ストア関連のファイルを検索
fd store

Length of output: 57

Script:

#!/bin/bash
# チャットログの状態管理を確認
rg -A 5 "chatLog" src/features/stores/

Length of output: 1024

locales/en/translation.json (1)

146-146: 翻訳の追加を承認します！

新しく追加された "Audio Mode (Beta)" の翻訳エントリは、既存の命名規則に従っており、適切に配置されています。

他の言語ファイルでの翻訳の一貫性を確認するため、以下のスクリプトを実行します：

✅ Verification successful

翻訳の一貫性が確認できました！

すべての言語ファイル（英語、日本語、韓国語、中国語）で "AudioMode" の翻訳が適切に実装されており、以下の点で一貫性が保たれています：

すべての言語で Beta 表記が含まれている
各言語の慣習に従った適切な表記が使用されている（括弧の種類など）
意味が正確に伝わる翻訳がなされている

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# 全ての言語ファイルで "AudioMode" キーの存在と値を確認
for lang in en ja ko zh; do
  echo "Checking $lang translation:"
  jq '.AudioMode' "locales/$lang/translation.json"
done

Length of output: 581

src/features/stores/settings.ts (3)

14-14: 型定義の追加が適切に行われています

オーディオモード関連の設定がGeneralインターフェースに適切に追加されており、型の整合性が保たれています。

Also applies to: 99-101

286-288: 状態の永続化が適切に実装されています

partialize関数内でオーディオモード関連の設定が適切に永続化対象として追加されています。これにより、ユーザーの設定が正しく保存・復元されます。

224-230: 環境変数からの初期値設定が適切に実装されています

オーディオモード関連の3つの設定について、環境変数から適切にデフォルト値が設定されています：

audioMode: NEXT_PUBLIC_AUDIO_MODEから設定
audioModeInputType: NEXT_PUBLIC_AUDIO_MODE_INPUT_TYPEから設定（デフォルト: 'input_text'）
audioModeVoice: NEXT_PUBLIC_AUDIO_MODE_VOICEから設定（デフォルト: 'shimmer'）

環境変数の存在を確認するスクリプトを実行します：

src/components/messageInputContainer.tsx (3)

254-260: メッセージ送信の実装が適切です！

コールバック関数の最適化が行われており、空のメッセージを防ぐバリデーションも実装されています。

Line range hint 262-267: 入力ハンドラーの実装が適切です！

useCallbackの依存配列が空であり、イベントの型定義も適切に行われています。

Line range hint 269-276: MessageInputコンポーネントの型定義の更新確認が必要です

新しく追加されたprops（onChangeUserMessageとonClickSendButton）がMessageInputコンポーネントの型定義に正しく追加されているか確認してください。

以下のスクリプトで型定義を確認できます：

✅ Verification successful

型定義の確認が完了しました - 問題ありません

src/components/messageInput.tsxの型定義を確認したところ、新しく追加されたprops（onChangeUserMessageとonClickSendButton）が正しく定義されていることが確認できました。以下の通り、必要な型定義がすべて含まれています：

onChangeUserMessage: (event: React.ChangeEvent<HTMLInputElement | HTMLTextAreaElement>) => void
onClickSendButton: (event: React.MouseEvent<HTMLButtonElement>) => void

型の整合性が取れており、追加の修正は必要ありません。

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# MessageInputコンポーネントの型定義を確認
ast-grep --pattern 'type $Props = {
  $$$
  onChangeUserMessage?: $_
  onClickSendButton?: $_
  $$$
}'

Length of output: 104

Script:

#!/bin/bash
# MessageInputコンポーネントの型定義を検索（より広範な検索）
ast-grep --pattern 'type Props = {
  $$$
}'

# バックアップとして、ファイル内容を直接確認
rg -A 10 "type Props" src/components/MessageInput

Length of output: 1146

src/components/settings/modelProvider.tsx (4)

12-13: 型のインポートが適切に追加されています。

AudioModeInputTypeとOpenAITTSVoiceの型が正しくインポートされており、音声モード機能の型安全性が確保されています。

28-30: 音声モード関連のステート変数が適切に実装されています。

audioMode、audioModeInputType、audioModeVoiceの3つのステート変数がsettingsStoreから正しく取得されています。

101-112: モード切り替えの相互排他制御が実装されています。

リアルタイムAPIモードと音声モードが同時にオンにならないように適切に制御されています。

293-293: 条件分岐の実装が適切です。

!realtimeAPIMode && !audioModeの条件チェックにより、音声モードがオンの場合にモデル選択UIが非表示になる実装が正しく行われています。

src/features/chat/handlers.ts (5)

9-9: messageSelectorsを正しくインポートしています

messageSelectorsモジュールのインポートが適切に行われています。

149-149: getAIChatResponseStreamの非同期呼び出しを確認しました

getAIChatResponseStream関数をawaitで正しく非同期呼び出ししており、エラーハンドリングも適切です。

311-311: aiTextLogのメッセージ処理が改善されています

aiTextLogをmessageSelectors.getProcessedMessagesで処理することで、メッセージの整形が行われ、コードの可読性と保守性が向上しています。

435-435: messageLogの処理方法が改善されています

messageSelectors.getProcessedMessagesを使用してmessageLogを処理することで、一貫性のあるメッセージデータが生成されます。

552-553: ss.realtimeAPIModeとss.audioModeが同時に有効な場合の挙動を確認してください

現在の条件分岐では、ss.realtimeAPIModeがtrueの場合、ss.audioModeがtrueでもss.realtimeAPIModeのブロックのみが実行されます。両方が同時にtrueになる可能性がある場合、期待する動作になるか確認し、必要であればロジックを修正してください。

coderabbitai · 2024-10-30T10:35:46Z

src/features/constants/settings.ts


 export type RealtimeAPIModeContentType = 'input_text' | 'input_audio'
 export type RealtimeAPIModeVoice = 'alloy' | 'echo' | 'shimmer'
+export type AudioModeInputType = 'input_text' | 'input_audio'


⚠️ Potential issue

コードの重複が見られます

AudioModeInputType は既存の RealtimeAPIModeContentType と同じ値を持つユニオン型として定義されています。コードの保守性を向上させるため、以下のいずれかの対応を検討してください：

既存の型を再利用する

両方の型が必要な理由を明確にするためにコメントを追加する

重複を解消する場合は、以下の修正を提案します：

-export type AudioModeInputType = 'input_text' | 'input_audio' +export type AudioModeInputType = RealtimeAPIModeContentType

Committable suggestion was skipped due to low confidence.

coderabbitai · 2024-10-30T10:35:46Z

src/components/settings/log.tsx

+  const chatLog = messageSelectors.getTextAndImageMessages(
+    homeStore((s) => s.chatLog)
+  )

-  const newChatLog = hs.chatLog.map((m, i) => {
+  const newChatLog = chatLog.map((m, i) => {


🛠️ Refactor suggestion

エラーハンドリングの追加を推奨します

メッセージの更新処理に対してエラーハンドリングが実装されていません。予期せぬエラーが発生した場合のユーザー体験を改善するため、try-catch文の追加を検討してください。

以下のような実装を提案します：

const handleChangeChatLog = (targetIndex: number, text: string) => { + try { const chatLog = messageSelectors.getTextAndImageMessages( homeStore((s) => s.chatLog) ) const newChatLog = chatLog.map((m, i) => { return i === targetIndex ? { role: m.role, content: text } : m }) homeStore.setState({ chatLog: newChatLog }) + } catch (error) { + console.error('Failed to update chat log:', error) + // TODO: ユーザーへのエラー通知を実装 + } }

Committable suggestion was skipped due to low confidence.

coderabbitai · 2024-10-30T10:35:47Z

src/features/messages/messageSelectors.ts

+  getTextAndImageMessages: (messages: Message[]): Message[] => {
+    return messages
+      .filter(
+        (message) => message.content !== null && message.content !== undefined
+      )
+      .filter((message) => {
+        if (
+          typeof message.content === 'string' ||
+          Array.isArray(message.content)
+        ) {
+          return true
+        }
+        return false
+      })
+  },


🛠️ Refactor suggestion

型安全性とコードの可読性を改善することを提案します

現在の実装は機能していますが、以下の改善が可能です：

ネストされたfilterを1つにまとめる

型チェックをより明示的にする

以下のようなリファクタリングを提案します：

getTextAndImageMessages: (messages: Message[]): Message[] => { - return messages - .filter( - (message) => message.content !== null && message.content !== undefined - ) - .filter((message) => { - if ( - typeof message.content === 'string' || - Array.isArray(message.content) - ) { - return true - } - return false - }) + return messages.filter((message): boolean => { + if (!message.content) return false; + return typeof message.content === 'string' || Array.isArray(message.content); + }); },

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

getTextAndImageMessages: (messages: Message[]): Message[] => {

return messages

.filter(

(message) => message.content !== null && message.content !== undefined

)

.filter((message) => {

if (

typeof message.content === 'string' ||

Array.isArray(message.content)

) {

return true

}

return false

})

},

getTextAndImageMessages: (messages: Message[]): Message[] => {

return messages.filter((message): boolean => {

if (!message.content) return false;

return typeof message.content === 'string' || Array.isArray(message.content);

});

},

coderabbitai · 2024-10-30T10:35:47Z

src/features/messages/messageSelectors.ts

+  normalizeMessages: (messages: Message[]): Message[] => {
+    let lastImageUrl = ''
+    return messages
+      .reduce((acc: Message[], item: Message) => {
+        if (
+          item.content &&
+          typeof item.content != 'string' &&
+          item.content[0] &&
+          item.content[1]
+        ) {
+          lastImageUrl = item.content[1].image
+        }
+
+        const lastItem = acc[acc.length - 1]
+        if (lastItem && lastItem.role === item.role) {
+          if (typeof item.content != 'string' && item.content) {
+            lastItem.content += ' ' + item.content[0].text
+          } else {
+            lastItem.content += ' ' + item.content
+          }
+        } else {
+          const text = item.content
+            ? typeof item.content != 'string'
+              ? item.content[0].text
+              : item.content
+            : ''
+          if (lastImageUrl != '') {
+            acc.push({
+              ...item,
+              content: [
+                { type: 'text', text: text.trim() },
+                { type: 'image', image: lastImageUrl },
+              ],
+            })
+            lastImageUrl = ''
+          } else {
+            acc.push({ ...item, content: text.trim() })
+          }
+        }
+        return acc
+      }, [])
+      .filter((item) => item.content !== '')
+  },


⚠️ Potential issue

複雑な条件分岐の簡素化と型安全性の向上が必要です

現在の実装には以下の課題があります：

複雑なネストされた条件分岐

型安全性の問題

ミュータブルな状態管理（lastImageUrl）

以下のようなリファクタリングを提案します：

+ interface ImageContent { + type: 'image'; + image: string; + } + interface TextContent { + type: 'text'; + text: string; + } + const isImageContent = (content: unknown): content is (TextContent | ImageContent)[] => + Array.isArray(content) && content.length >= 2; normalizeMessages: (messages: Message[]): Message[] => { - let lastImageUrl = '' + const state = { + lastImageUrl: '', + }; return messages .reduce((acc: Message[], item: Message) => { - if ( - item.content && - typeof item.content != 'string' && - item.content[0] && - item.content[1] - ) { - lastImageUrl = item.content[1].image + if (isImageContent(item.content)) { + state.lastImageUrl = item.content[1].image; } // ... rest of the implementation }, [])

Committable suggestion was skipped due to low confidence.

coderabbitai · 2024-10-30T10:35:47Z

src/features/messages/messageSelectors.ts

+  getAudioMessages: (messages: Message[]): Message[] => {
+    return messages.filter((message) => {
+      // userの場合：contentがstring型のメッセージのみを許可
+      if (message.role === 'user') {
+        return typeof message.content === 'string'
+      }
+      // assistantの場合：audioプロパティを持つメッセージのみを許可
+      if (message.role === 'assistant') {
+        return message.audio !== undefined
+      }
+      // その他のroleは除外
+      return false
+    })
+  },


🛠️ Refactor suggestion

型の安全性と保守性を向上させる改善を提案します

以下の点について改善の余地があります：

型ガードの導入

ロール定数の定義

以下のような改善を提案します：

+ const VALID_ROLES = { + USER: 'user' as const, + ASSISTANT: 'assistant' as const, + }; getAudioMessages: (messages: Message[]): Message[] => { return messages.filter((message) => { - if (message.role === 'user') { + if (message.role === VALID_ROLES.USER) { return typeof message.content === 'string' } - if (message.role === 'assistant') { + if (message.role === VALID_ROLES.ASSISTANT) { return message.audio !== undefined } return false }) },

Committable suggestion was skipped due to low confidence.

coderabbitai · 2024-10-30T10:35:48Z

src/features/chat/openAIAudioChat.ts

+  const openai = new OpenAI({
+    apiKey: ss.openaiKey,
+    dangerouslyAllowBrowser: true,
+  })


⚠️ Potential issue

セキュリティ上の懸念: 'dangerouslyAllowBrowser' の使用について

dangerouslyAllowBrowser: true を設定すると、APIキーがクライアントサイドで露出する可能性があり、セキュリティリスクとなります。サーバーサイドでAPIリクエストを処理するか、他の安全な方法を検討してください。

coderabbitai · 2024-10-30T10:35:48Z

src/features/chat/openAIAudioChat.ts

+        })
+
+        for await (const chunk of response) {
+          const audio = (chunk.choices[0]?.delta as any)?.audio


🛠️ Refactor suggestion

型の安全性を向上させるために 'as any' の使用を避ける

chunk.choices[0]?.delta に対して as any を使用していますが、これはTypeScriptの型チェックを回避してしまいます。適切な型定義を追加して、型の安全性を確保しましょう。

必要であれば、適切な型定義の作成をお手伝いできます。ご希望であればお知らせください。

coderabbitai · 2024-10-30T10:35:48Z

src/features/chat/openAIAudioChat.ts

+        for await (const chunk of response) {
+          const audio = (chunk.choices[0]?.delta as any)?.audio
+          if (audio) {
+            if (audio.transcript) {
+              controller.enqueue(audio.transcript)
+            }
+            if (audio.data) {
+              bufferManager.addData(base64ToArrayBuffer(audio.data))
+            }
+            if (audio.id) {
+              hs.chatLog.push({
+                role: 'assistant',
+                audio: { id: audio.id },
+              })
+            }
+          }
+        }


🛠️ Refactor suggestion

ループ内のエラーハンドリングを強化する

for await ループ内での処理中に、controller.enqueue や bufferManager.addData がエラーをスローする可能性があります。これらのエラーを適切にキャッチし、ストリームを安全に終了するために、ループ内にエラーハンドリングを追加することを検討してください。

以下のように、ループ内に try-catch ブロックを追加できます。

for await (const chunk of response) { + try { const audio = (chunk.choices[0]?.delta as any)?.audio if (audio) { if (audio.transcript) { controller.enqueue(audio.transcript) } if (audio.data) { bufferManager.addData(base64ToArrayBuffer(audio.data)) } if (audio.id) { hs.chatLog.push({ role: 'assistant', audio: { id: audio.id }, }) } } + } catch (err) { + console.error('チャンク処理中のエラー:', err) + controller.error(err) + break + } }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

for await (const chunk of response) {

const audio = (chunk.choices[0]?.delta as any)?.audio

if (audio) {

if (audio.transcript) {

controller.enqueue(audio.transcript)

}

if (audio.data) {

bufferManager.addData(base64ToArrayBuffer(audio.data))

}

if (audio.id) {

hs.chatLog.push({

role: 'assistant',

audio: { id: audio.id },

})

}

}

}

for await (const chunk of response) {

try {

const audio = (chunk.choices[0]?.delta as any)?.audio

if (audio) {

if (audio.transcript) {

controller.enqueue(audio.transcript)

}

if (audio.data) {

bufferManager.addData(base64ToArrayBuffer(audio.data))

}

if (audio.id) {

hs.chatLog.push({

role: 'assistant',

audio: { id: audio.id },

})

}

}

} catch (err) {

console.error('チャンク処理中のエラー:', err)

controller.error(err)

break

}

}

coderabbitai · 2024-10-30T10:35:48Z

src/utils/audioBufferManager.ts

+    if (this.buffer.byteLength > 0) {
+      const bufferToSend = this.buffer
+      this.buffer = new ArrayBuffer(0)
+      await this.sendCallback(bufferToSend)
+    }


🛠️ Refactor suggestion

⚠️ Potential issue

sendCallback のエラーハンドリングを追加してください

sendBuffer メソッド内で await this.sendCallback(bufferToSend) を呼び出していますが、sendCallback がエラーをスローした場合の処理がありません。エラーが発生した場合の対策を追加することをお勧めします。

修正案：

async sendBuffer(): Promise<void> { if (this.buffer.byteLength > 0) { const bufferToSend = this.buffer this.buffer = new ArrayBuffer(0) try { await this.sendCallback(bufferToSend) } catch (error) { console.error('バッファの送信に失敗しました:', error) // 必要に応じて再試行やエラーハンドリングを追加してください } } }

coderabbitai · 2024-10-30T10:35:48Z

src/utils/audioBufferManager.ts

+  addData(newData: ArrayBuffer): void {
+    this.buffer = this.mergeArrayBuffers(this.buffer, newData)
+    if (this.buffer.byteLength >= this.BUFFER_THRESHOLD) {
+      this.sendBuffer()
+    }
+  }


🛠️ Refactor suggestion

⚠️ Potential issue

sendBuffer() を呼び出す際の非同期処理を考慮してください

addData メソッド内で sendBuffer() は非同期関数ですが、await せずに呼び出されています。これにより、sendBuffer() の完了を待たずに次の処理が進むため、バッファのリセットが正しく行われない可能性があります。addData メソッドを async にし、await this.sendBuffer() とするか、sendBuffer() の戻り値を適切に処理してください。

修正案：

- addData(newData: ArrayBuffer): void { + async addData(newData: ArrayBuffer): Promise<void> { this.buffer = this.mergeArrayBuffers(this.buffer, newData) if (this.buffer.byteLength >= this.BUFFER_THRESHOLD) { - this.sendBuffer() + await this.sendBuffer() } }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

addData(newData: ArrayBuffer): void {

this.buffer = this.mergeArrayBuffers(this.buffer, newData)

if (this.buffer.byteLength >= this.BUFFER_THRESHOLD) {

this.sendBuffer()

}

}

async addData(newData: ArrayBuffer): Promise<void> {

this.buffer = this.mergeArrayBuffers(this.buffer, newData)

if (this.buffer.byteLength >= this.BUFFER_THRESHOLD) {

await this.sendBuffer()

}

}

…ons-api-audio-input OpenAI chat completions api のオーディオモード追加

tegnike added 10 commits October 28, 2024 15:59

Chat Completions APIのaudioモードを利用してtext outputで発話できるまで

c64ed27

音声を再生できるまで（通常のフローと二重で音声が流れる問題あり）

c51255f

エラー修正

3269401

chatLogのフィルター関数追加

b85959f

Chat Completions APIのaudioモードで会話できるまで

a27c7eb

不要なAPI呼び出しを辞める

7f50702

型エラー修正

df4cb7c

型エラー修正

e87ed1c

型エラー修正

759c48a

localesファイル更新

4d67972

coderabbitai bot reviewed Oct 30, 2024

View reviewed changes

tegnike merged commit 66681a8 into develop Oct 30, 2024

tegnike deleted the feature/add-chat-completions-api-audio-input branch October 30, 2024 10:54

coderabbitai bot mentioned this pull request Apr 1, 2025

Develop #327

Merged

coderabbitai bot mentioned this pull request Apr 23, 2025

本番リリース #364

Closed

terisuke pushed a commit to terisuke/aituber-kit that referenced this pull request Dec 5, 2025

Merge pull request tegnike#213 from tegnike/feature/add-chat-completi…

9ec5039

…ons-api-audio-input OpenAI chat completions api のオーディオモード追加

Uh oh!

OpenAI chat completions api のオーディオモード追加 #213

OpenAI chat completions api のオーディオモード追加 #213

Uh oh!

Conversation

tegnike commented Oct 30, 2024 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

vercel bot commented Oct 30, 2024

Uh oh!

coderabbitai bot commented Oct 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Possibly related PRs

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tegnike commented Oct 30, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 30, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)