To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???悠?㎝濡??鴨 0011111100111111001111111001011101001001001111111000011101110000100101000100011100111111001111111000101010011011 3f3f3f97493f877094473f3f8a9b
EUC-JP ???悠??濡??鴨 00111111001111110011111111001101101010100011111100111111110001111010100000111111001111111011001111111011 3f3f3fcdaa3f3fc7a83f3fb3fb
UTF-8 捻뀁쉶悠귨㎝濡⑸젧鴨 111011111010011010100100111010111000000010000001111011001000100110110110111001101000001010100000111010101011011110101000111000111000111010011101111001101011111110100001111000101001000110111000111011001010000010100111111010011011010010101000 efa6a4eb8081ec89b6e682a0eab7a8e38e9de6bfa1e291b8eca0a7e9b4a8
UHC 捻뀁쉶悠귨㎝濡⑸젧鴨 1110011011110111101100101110110010011010100011001110101011101101100000101110111110100111101011111110101110100001101010011110101110100000100111111110010011100101 e6f7b2ec9a8ceaed82efa7afeba1a9eba09fe4e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)