To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?謔孩?孩?孩?去?謔孩?孩?孩?醵^ 0011111111100110100000101001101101110111001111111001101101110111001111111001101101110111001111111000101110001110001111111110011010000010100110110111011100111111100110110111011100111111100110110111011100111111111001111101000101011110 3fe6829b773f9b773f9b773f8b8e3fe6829b773f9b773f9b773fe7d15e
EUC-JP ?謔孩?孩?孩?去?謔孩?孩?孩?醵^ 0011111111101011111000101101010111011000001111111101010111011000001111111101010111011000001111111011010111101110001111111110101111100010110101011101100000111111110101011101100000111111110101011101100000111111111011101101001101011110 3febe2d5d83fd5d83fd5d83fb5ee3febe2d5d83fd5d83fd5d83feed35e
UTF-8 뤋謔孩귑孩꾜孩㈈去뤋謔孩귑孩꾜孩㈈醵^ 11101011101001001000101111101000101011001001010011100101101011011010100111101010101101111001000111100101101011011010100111101010101111101001110011100101101011011010100111100011100010001000100011100101100011101011101111101011101001001000101111101000101011001001010011100101101011011010100111101010101101111001000111100101101011011010100111101010101111101001110011100101101011011010100111100011100010001000100011101001100001101011010101011110 eba48be8ac94e5ada9eab791e5ada9eabe9ce5ada9e38888e58ebbeba48be8ac94e5ada9eab791e5ada9eabe9ce5ada9e38888e986b55e
UHC 뤋謔孩귑孩꾜孩㈈去뤋謔孩귑孩꾜孩㈈醵^ 10001111101110111111100111001100111110101010100110110001110100101111101010101001101100101101100011111010101010011010100110111001110010111101101110001111101110111111100111001100111110101010100110110001110100101111101010101001101100101101100011111010101010011010100110111001110010111101100101011110 8fbbf9ccfaa9b1d2faa9b2d8faa9a9b9cbdb8fbbf9ccfaa9b1d2faa9b2d8faa9a9b9cbd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)