To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 咳??楷??楷??[咳??楷??楷??[^ 100010100101000000111111001111111001111010110010001111110011111110011110101100100011111100111111010110111000101001010000001111110011111110011110101100100011111100111111100111101011001000111111001111110101101101011110 8a503f3f9eb23f3f9eb23f3f5b8a503f3f9eb23f3f9eb23f3f5b5e
EUC-JP 咳??楷??楷??[咳??楷??楷??[^ 101100111011000100111111001111111101110010110100001111110011111111011100101101000011111100111111010110111011001110110001001111110011111111011100101101000011111100111111110111001011010000111111001111110101101101011110 b3b13f3fdcb43f3fdcb43f3f5bb3b13f3fdcb43f3fdcb43f3f5b5e
UTF-8 咳띌맛楷쇤눼楷곈짹[咳띌맛楷쇤눼楷곈짹[^ 111001011001001010110011111010111001110110001100111010111010011110011011111001101010010110110111111011001000011110100100111010111000100010111100111001101010010110110111111010101011001110001000111011001010011110111001010110111110010110010010101100111110101110011101100011001110101110100111100110111110011010100101101101111110110010000111101001001110101110001000101111001110011010100101101101111110101010110011100010001110110010100111101110010101101101011110 e592b3eb9d8ceba79be6a5b7ec87a4eb88bce6a5b7eab388eca7b95be592b3eb9d8ceba79be6a5b7ec87a4eb88bce6a5b7eab388eca7b95b5e
UHC 咳띌맛楷쇤눼楷곈짹[咳띌맛楷쇤눼楷곈짹[^ 111110101010011010110110111010011011100011000000111110101010110010111100111010011011010010110100111110101010110010110000111010011100001010110001010110111111101010100110101101101110100110111000110000001111101010101100101111001110100110110100101101001111101010101100101100001110100111000010101100010101101101011110 faa6b6e9b8c0faacbce9b4b4faacb0e9c2b15bfaa6b6e9b8c0faacbce9b4b4faacb0e9c2b15b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)