To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????q????????O 00111111001111110011111100111111001111110011111101110001001111110011111100111111001111110011111100111111001111110011111101001111 3f3f3f3f3f3f713f3f3f3f3f3f3f3f4f
SJIS-WIN シト篠芝シタqシト偲痔シト偲璽O 10111100110001001000111011000010100011101100010110111100110000000111000110111100110001001000111011000011100011101010010010111100110001001000111011000011100011101010001101001111 bcc48ec28ec5bcc071bcc48ec38ea4bcc48ec38ea34f
EUC-JP シト篠芝シタqシト偲痔シト偲璽O 100011101011110010001110110001001011110011000100101111001100011110001110101111001000111011000000011100011000111010111100100011101100010010111100110001011011110010100110100011101011110010001110110001001011110011000101101111001010010101001111 8ebc8ec4bcc4bcc78ebc8ec0718ebc8ec4bcc5bca68ebc8ec4bcc5bca54f
UTF-8 シト篠芝シタqシト偲痔シト偲璽O 1110111110111101101111001110111110111110100001001110011110101111101000001110100010001010100111011110111110111101101111001110111110111110100000000111000111101111101111011011110011101111101111101000010011100101100000011011001011100111100101111001010011101111101111011011110011101111101111101000010011100101100000011011001011100111100100101011110101001111 efbdbcefbe84e7afa0e88a9defbdbcefbe8071efbdbcefbe84e581b2e79794efbdbcefbe84e581b2e792bd4f
UHC ??篠芝??q???痔???璽O 0011111100111111111000011100011011110010101110010011111100111111011100010011111100111111001111111111011011000000001111110011111100111111110111111101111001001111 3f3fe1c6f2b93f3f713f3f3ff6c03f3f3fdfde4f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)