To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 肢私????淫?肢私????淫?B 1000111010001000100011101000010000111111001111110011111100111111100010001111101000111111100011101000100010001110100001000011111100111111001111110011111110001000111110100011111101000010 8e888e843f3f3f3f88fa3f8e888e843f3f3f3f88fa3f42
EUC-JP 肢私????淫?肢私????淫?B 1011101111101000101110111110010000111111001111110011111100111111101100001111110000111111101110111110100010111011111001000011111100111111001111110011111110110000111111000011111101000010 bbe8bbe43f3f3f3fb0fc3fbbe8bbe43f3f3f3fb0fc3f42
UTF-8 肢私렎렠裏렦淫괌肢私렎렠裏렦淫괌B 11101000100000101010001011100111101001111000000111101011101000001000111011101011101000001010000011101111101001111010011111101011101000001010011011100110101101111010101111101010101101001000110011101000100000101010001011100111101001111000000111101011101000001000111011101011101000001010000011101111101001111010011111101011101000001010011011100110101101111010101111101010101101001000110001000010 e882a2e7a781eba08eeba0a0efa7a7eba0a6e6b7abeab48ce882a2e7a781eba08eeba0a0efa7a7eba0a6e6b7abeab48c42
UHC 肢私렎렠裏렦淫괌肢私렎렠裏렦淫괌B 111100101011011011011110111001111000111010100100100011101011000111101100110000001000111010110101111010111110001010110001101000011111001010110110110111101110011110001110101001001000111010110001111011001100000010001110101101011110101111100010101100011010000101000010 f2b6dee78ea48eb1ecc08eb5ebe2b1a1f2b6dee78ea48eb1ecc08eb5ebe2b1a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)