To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????H??????????^ 00111111001111110011111100111111010010000011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f483f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN シナシァHシト痔芝シッシト漆室^ 1011110011000101101111001010011101001000101111001100010010001110101001001000111011000101101111001010111110111100110001001000111010111101100011101011101001011110 bcc5bca748bcc48ea48ec5bcafbcc48ebd8eba5e
EUC-JP シナシァHシト痔芝シッシト漆室^ 100011101011110010001110110001011000111010111100100011101010011101001000100011101011110010001110110001001011110010100110101111001100011110001110101111001000111010101111100011101011110010001110110001001011110010111111101111001011110001011110 8ebc8ec58ebc8ea7488ebc8ec4bca6bcc78ebc8eaf8ebc8ec4bcbfbcbc5e
UTF-8 シナシァHシト痔芝シッシト漆室^ 1110111110111101101111001110111110111110100001011110111110111101101111001110111110111101101001110100100011101111101111011011110011101111101111101000010011100111100101111001010011101000100010101001110111101111101111011011110011101111101111011010111111101111101111011011110011101111101111101000010011100110101111001000011011100101101011101010010001011110 efbdbcefbe85efbdbcefbda748efbdbcefbe84e79794e88a9defbdbcefbdafefbdbcefbe84e6bc86e5aea45e
UHC ????H??痔芝????漆室^ 0011111100111111001111110011111101001000001111110011111111110110110000001111001010111001001111110011111100111111001111111111011011010100111000111111100001011110 3f3f3f3f483f3ff6c0f2b93f3f3f3ff6d4e3f85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)