To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?あ健お??い〓???????????? 00111111100000101010000010001100100100101000001010101000001111110011111110000010101000101000000110101100001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f82a08c9282a83f3f82a281ac3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ?あ健お??い〓???????????? 00111111101001001010001010110111111100101010010010101010001111110011111110100100101001001010001010101110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3fa4a2b7f2a4aa3f3fa4a4a2ae3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 룵あ健お룫킃い〓룵₃룵₃룵첂◈룵₃룵쨵◐ 111010111010001110110101111000111000000110000010111001011000000110100101111000111000000110001010111010111010001110101011111011011000001010000011111000111000000110000100111000111000000010010011111010111010001110110101111000101000001010000011111010111010001110110101111000101000001010000011111010111010001110110101111011001011001010000010111000101001011110001000111010111010001110110101111000101000001010000011111010111010001110110101111011001010100010110101111000101001011110010000 eba3b5e38182e581a5e3818aeba3abed8283e38184e38093eba3b5e28283eba3b5e28283eba3b5ecb282e29788eba3b5e28283eba3b5eca8b5e29790
UHC 룵あ健お룫킃い〓룵₃룵₃룵첂◈룵₃룵쨵◐ 10001111101010101010101010100010110010111110110110101010101010101000111110100010101101001000111110101010101001001010000111101011100011111010101010101001111111011000111110101010101010011111110110001111101010101010101010001111101000101100001010001111101010101010100111111101100011111010101010100100100011111010001011000100 8faaaaa2cbedaaaa8fa2b48faaa4a1eb8faaa9fd8faaa9fd8faaaa8fa2c28faaa9fd8faaa48fa2c4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)