To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?蒡??驚漿?坎去?蒡??驚漿?坎醵^ 0011111111100100111011100011111100111111100010111100000110011111111101110011111110011010101010101000101110001110001111111110010011101110001111110011111110001011110000011001111111110111001111111001101010101010111001111101000101011110 3fe4ee3f3f8bc19ff73f9aaa8b8e3fe4ee3f3f8bc19ff73f9aaae7d15e
EUC-JP ?蒡??驚漿?坎去?蒡??驚漿?坎醵^ 0011111111101000111100000011111100111111101101101100001111011110111110010011111111010100101011001011010111101110001111111110100011110000001111110011111110110110110000111101111011111001001111111101010010101100111011101101001101011110 3fe8f03f3fb6c3def93fd4acb5ee3fe8f03f3fb6c3def93fd4aceed35e
UTF-8 뤾蒡놈퓥驚漿쥙坎去뤾蒡놈퓥驚漿쥙坎醵^ 11101011101001001011111011101000100100101010000111101011100001101000100011101101100100111010010111101001101010011001101011100110101111001011111111101100101001011001100111100101100111011000111011100101100011101011101111101011101001001011111011101000100100101010000111101011100001101000100011101101100100111010010111101001101010011001101011100110101111001011111111101100101001011001100111100101100111011000111011101001100001101011010101011110 eba4bee892a1eb8688ed93a5e9a99ae6bcbfeca599e59d8ee58ebbeba4bee892a1eb8688ed93a5e9a99ae6bcbfeca599e59d8ee986b55e
UHC 뤾蒡놈퓥驚漿쥙坎去뤾蒡놈퓥驚漿쥙坎醵^ 10001111111010101101101110111100101100111111000010111111100011101100110011110011111011011110110010100010100011101100101011101100110010111101101110001111111010101101101110111100101100111111000010111111100011101100110011110011111011011110110010100010100011101100101011101100110010111101100101011110 8feadbbcb3f0bf8eccf3edeca28ecaeccbdb8feadbbcb3f0bf8eccf3edeca28ecaeccbd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)