To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 遙??節??言??語↓ゴ偃??躍??歪??^ 11101010101000010011111100111111100100001101111100111111001111111000110010111110001111110011111110001100111010101000000110101011100000110101001110011000111011100011111100111111100101101111010000111111001111111001100001100011001111110011111101011110 eaa13f3f90df3f3f8cbe3f3f8cea81ab835398ee3f3f96f43f3f98633f3f5e
EUC-JP 遙??節??言??語↓ゴ偃??躍??歪??^ 11110100101000110011111100111111110000001110000100111111001111111011100011000000001111110011111110111000111011001010001010101101101001011011010011010000111100000011111100111111110011001111011000111111001111111100111111000100001111110011111101011110 f4a33f3fc0e13f3fb8c03f3fb8eca2ada5b4d0f03f3fccf63f3fcfc43f3f5e
UTF-8 遙닸돇節얍툦言꿴궓語↓ゴ偃섓슭躍앮뜴歪ⓩ궠^ 11101001100000011001100111101011100010111011100011101011100011111000011111100111101011111000000011101100100101101000110111101101100010001010011011101000101010001000000011101010101111111011010011101010101101101001001111101000101010101001111011100010100001101001001111100011100000101011010011100101100000011000001111101100100001001001001111101100100010101010110111101000101110101000110111101100100101011010111011101011100111001011010011100110101011011010101011100010100100111010100111101010101101101010000001011110 e98199eb8bb8eb8f87e7af80ec968ded88a6e8a880eabfb4eab693e8aa9ee28693e382b4e58183ec8493ec8aade8ba8dec95aeeb9cb4e6adaae293a9eab6a05e
UHC 遙닸돇節얍툦言꿴궓語↓ゴ偃섓슭躍앮뜴歪ⓩ궠^ 11101001101010111011010011100110100010011001100011101111101111011011111011100101101110001001110111100101111010111011001011101001100000101010100011100101110111101010000111101001101010111011010011100101111001111001100011101111101111011011111011100101101110001001110111100110100011011011001011101000111000001010100011100110100000101011001101011110 e9abb4e68998efbdbee5b89de5ebb2e982a8e5dea1e9abb4e5e798efbdbee5b89de68db2e8e0a8e682b35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)