To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 梧??訟??譯??節??崖??梧??節??^ 1000110011100110001111110011111110001111110101110011111100111111111001101010000100111111001111111001000011011111001111110011111110001010010100100011111100111111100011001110011000111111001111111001000011011111001111110011111101011110 8ce63f3f8fd73f3fe6a13f3f90df3f3f8a523f3f8ce63f3f90df3f3f5e
EUC-JP 梧??訟??譯??節??崖??梧??節??^ 1011100011101000001111110011111110111110110110010011111100111111111011001010001100111111001111111100000011100001001111110011111110110011101100110011111100111111101110001110100000111111001111111100000011100001001111110011111101011110 b8e83f3fbed93f3feca33f3fc0e13f3fb3b33f3fb8e83f3fc0e13f3f5e
UTF-8 梧녑뮧訟귟옾譯꾬쉼節계옾崖꿎뒄梧녔퍍節곤숴^ 11100110101000101010011111101011100001011001000111101011101011101010011111101000101010001001111111101010101101111001111111101100100110001011111011101000101011011010111111101010101111101010110011101100100010011011110011100111101011111000000011101010101100111000010011101100100110001011111011100101101101001001011011101010101111111000111011101011100100101000010011100110101000101010011111101011100001011001010011101101100011011000110111100111101011111000000011101010101100111010010011101100100010001011010001011110 e6a2a7eb8591ebaea7e8a89feab79fec98bee8adafeabeacec89bce7af80eab384ec98bee5b496eabf8eeb9284e6a2a7eb8594ed8d8de7af80eab3a4ec88b45e
UHC 梧녑뮧訟귟옾譯꾬쉼節계옾崖꿎뒄梧녔퍍節곤숴^ 11100111111111001011001111100101100100101011001011100001111010001000001011101000100111101011001111100110101110111000010011101111101111011011000011101111101111011011000011101000100111101011001111100100111100001011001011100010100010101000001011100111111111001011001111100110101110111000010011101111101111011011000011101111101111011010010001011110 e7fcb3e592b2e1e882e89eb3e6bb84efbdb0efbdb0e89eb3e4f0b2e28a82e7fcb3e6bb84efbdb0efbda45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)