To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蜈ゆ?鴦?ⅹ歟??^ 111001011000010110000010111001000011111111101001111100010011111111111010010010011001111101100010001111110011111101011110 e58582e43fe9f13ffa499f623f3f5e
EUC-JP 蜈ゆˇ鴦??歟??^ 11101001111001011010010011100110100011111010001010110000111100101111001100111111001111111101110111000011001111110011111101011110 e9e5a4e68fa2b0f2f33f3fddc33f3f5e
UTF-8 蜈ゆˇ鴦⑵ⅹ歟㎩렔^ 111010001001110010001000111000111000001010000110110010111000011111101001101101001010011011100010100100011011010111100010100001011011100111100110101011011001111111100011100011101010100111101011101000001001010001011110 e89c88e38286cb87e9b4a6e291b5e285b9e6ad9fe38ea9eba0945e
UHC 蜈ゆˇ鴦⑵ⅹ歟㎩렔^ 11101000101001011010101011100110101000101010011111100100111011001010100111101000101001011010101011100110101000101010011111100101100011101010100101011110 e8a5aae6a2a7e4eca9e8a5aae6a2a7e58ea95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)