To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 櫻???よ?惟??堊 1001111101001110001111110011111100111111100000101110011000111111100010001101001000111111001111111001101010111111 9f4e3f3f3f82e63f88d23f3f9abf
EUC-JP 櫻???よ?惟??堊 1101110110101111001111110011111100111111101001001110100000111111101100001101010000111111001111111101010011000001 ddaf3f3f3fa4e83fb0d43f3fd4c1
UTF-8 櫻뗰퐛藺よ뵯惟곗㉠堊 111001101010101110111011111010111001011110110000111011011001000010011011111011111010011110110000111000111000001010001000111010111011010110101111111001101000001110011111111010101011001110010111111000111000100110100000111001011010000010001010 e6abbbeb97b0ed909befa7b0e38288ebb5afe6839feab397e389a0e5a08a
UHC 櫻뗰퐛藺よ뵯惟곗㉠堊 1110010110100001100010111110111110111101100001011110110011100001101010101110100010010100101011011110101011101110101100001110110010101000101100011110010010111110 e5a18befbd85ece1aae894adeaeeb0eca8b1e4be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)