To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???松??鍮??妖??妖??仰??異?? 001111110011111100111111100011111011110000111111001111111110100001001010001111110011111110010111011001000011111100111111100101110110010000111111001111111000101111000010001111110011111110001000110110010011111100111111 3f3f3f8fbc3f3fe84a3f3f97643f3f97643f3f8bc23f3f88d93f3f
EUC-JP ???松??鍮??妖??妖??仰??異?? 001111110011111100111111101111101011111000111111001111111110111110101011001111110011111111001101110001010011111100111111110011011100010100111111001111111011011011000100001111110011111110110000110110110011111100111111 3f3f3fbebe3f3fefab3f3fcdc53f3fcdc53f3fb6c43f3fb0db3f3f
UTF-8 琉쀭슀松싳떤鍮낅뿊妖껋븪妖껋빏仰띰퐥異뀀퉭 111011111010011110001100111011001000000010101101111011001000101010000000111001101001110110111110111011001000101110110011111010111001011010100100111010011000110110101110111010111000001010000101111010111011111110001010111001011010011010010110111010101011101110001011111010111011100010101010111001011010011010010110111010101011101110001011111010111011100110001111111001001011101110110000111010111001110110110000111011011001000010100101111001111001010110110000111010111000000010000000111011011000100110101101 efa78cec80adec8a80e69dbeec8bb3eb96a4e98daeeb8285ebbf8ae5a696eabb8bebb8aae5a696eabb8bebb98fe4bbb0eb9db0ed90a5e795b0eb8080ed89ad
UHC 琉쀭슀松싳떤鍮낅뿊妖껋븪妖껋빏仰띰퐥異뀀퉭 111010111010010010010111111011011001101010010011111000011110011010011010111011001011011010110010111010111011100110000101111010111001011110010001111010001110110110000011111011001001010110010011111010001110110110000011111011001001010110110011111001001110011010110110111011111011110110001110111011001011011010110010111010111011100110000101 eba497ed9a93e1e69aecb6b2ebb985eb9791e8ed83ec9593e8ed83ec95b3e4e6b6efbd8eecb6b2ebb985

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)