To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蒻の??艶k? 1110010011101000100000101100110000111111001111111000100110010000100000101000101100111111 e4e882cc3f3f8990828b3f
EUC-JP 蒻の??艶k? 1110100011101010101001001100111000111111001111111011000111110000101000111110101100111111 e8eaa4ce3f3fb1f0a3eb3f
UTF-8 蒻の쇰폀艶k랠 111010001001001010111011111000111000000110101110111011001000011110110000111011011000111110000000111010001000100110110110111011111011110110001011111010111001111010100000 e892bbe381aeec87b0ed8f80e889b6efbd8beb9ea0
UHC 蒻の쇰폀艶k랠 1110010110110110101010101100111010111100111010111011110010001111111001101111110110100011111010111011011110100100 e5b6aacebcebbc8fe6fda3ebb7a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)