To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?蒡??驚漿?坎宏?蒡??驚漿?坎槐^ 0011111111100100111011100011111100111111100010111100000110011111111101110011111110011010101010101000110101000111001111111110010011101110001111110011111110001011110000011001111111110111001111111001101010101010100111101100010101011110 3fe4ee3f3f8bc19ff73f9aaa8d473fe4ee3f3f8bc19ff73f9aaa9ec55e
EUC-JP ?蒡??驚漿?坎宏?蒡??驚漿?坎槐^ 0011111111101000111100000011111100111111101101101100001111011110111110010011111111010100101011001011100110101000001111111110100011110000001111110011111110110110110000111101111011111001001111111101010010101100110111001100011101011110 3fe8f03f3fb6c3def93fd4acb9a83fe8f03f3fb6c3def93fd4acdcc75e
UTF-8 뤾蒡놈퓥驚漿쥙坎宏뤾蒡놈퓥驚漿쥙坎槐^ 11101011101001001011111011101000100100101010000111101011100001101000100011101101100100111010010111101001101010011001101011100110101111001011111111101100101001011001100111100101100111011000111011100101101011101000111111101011101001001011111011101000100100101010000111101011100001101000100011101101100100111010010111101001101010011001101011100110101111001011111111101100101001011001100111100101100111011000111011100110101001111001000001011110 eba4bee892a1eb8688ed93a5e9a99ae6bcbfeca599e59d8ee5ae8feba4bee892a1eb8688ed93a5e9a99ae6bcbfeca599e59d8ee6a7905e
UHC 뤾蒡놈퓥驚漿쥙坎宏뤾蒡놈퓥驚漿쥙坎槐^ 10001111111010101101101110111100101100111111000010111111100011101100110011110011111011011110110010100010100011101100101011101100110011101101101110001111111010101101101110111100101100111111000010111111100011101100110011110011111011011110110010100010100011101100101011101100110011101101100101011110 8feadbbcb3f0bf8eccf3edeca28ecaeccedb8feadbbcb3f0bf8eccf3edeca28ecaecced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)