To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?脹???け邯え?け??夏荊?????泌荊 001111111001001010101111001111110011111100111111100000101010111111100111101101101000001010100110001111111000001010101111001111110011111110001001110001001000110001110100001111110011111100111111001111110011111110010100111001011000110001110100 3f92af3f3f3f82afe7b682a63f82af3f3f89c48c743f3f3f3f3f94e58c74
EUC-JP ?脹???け邯え?け??夏荊?????泌荊 001111111100010010110001001111110011111100111111101001001011000111101110101110001010010010101000001111111010010010110001001111110011111110110010110001101011011111010101001111110011111100111111001111110011111111001000111001111011011111010101 3fc4b13f3f3fa4b1eeb8a4a83fa4b13f3fb2c6b7d53f3f3f3f3fc8e7b7d5
UTF-8 뤋脹쭗샘랑け邯え곽け렓뤋夏荊콒컦샅렒뤋泌荊 111010111010010010001011111010001000010010111001111011001010110110010111111011001000001110011000111010111001111010010001111000111000000110010001111010011000001010101111111000111000000110001000111010101011001110111101111000111000000110010001111010111010000010010011111010111010010010001011111001011010010010001111111010001000110110001010111011001011110110010010111011001011101110100110111011001000001110000101111010111010000010010010111010111010010010001011111001101011001110001100111010001000110110001010 eba48be884b9ecad97ec8398eb9e91e38191e982afe38188eab3bde38191eba093eba48be5a48fe88d8aecbd92ecbba6ec8385eba092eba48be6b38ce88d8a
UHC 뤋脹쭗샘랑け邯え곽け렓뤋夏荊콒컦샅렒뤋泌荊 100011111011101111110011111011001010011110001111101110111111100110110110111110111010101010110001110010101111101110101010101010001011000011111011101010101011000110001110101010001000111110111011111110011011111011111011101010101011000110001110101100001000111110111011111101001000111010100111100011111011101111111001101100101111101110101010 8fbbf3eca78fbbf9b6fbaab1cafbaaa8b0fbaab18ea88fbbf9befbaab18eb08fbbf48ea78fbbf9b2fbaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)