To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 曄?????寤よ? 100111100100000000111111001111110011111100111111001111111001101110001000100000101110011000111111 9e403f3f3f3f3f9b8882e63f
EUC-JP 曄?????寤よ? 110110111010000100111111001111110011111100111111001111111101010111101000101001001110100000111111 dba13f3f3f3f3fd5e8a4e83f
UTF-8 曄쒕젺略듐룜寤よ뇥 111001101001101110000100111011001001001010010101111011001010000010111010111011111010010110110110111010111001001110010000111010111010001110011100111001011010111110100100111000111000001010001000111010111000011110100101 e69b84ec9295eca0baefa5b6eb9390eba39ce5afa4e38288eb87a5
UHC 曄쒕젺略듐룜寤よ뇥 111001111010010110011100111010111010000010101101111001011011001010110101111000111000111110011000111001111111010110101010111010001000011110001101 e7a59ceba0ade5b2b5e38f98e7f5aae8878d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)