To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????K 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f4b
SJIS-WIN シナシ・シト痔柴借芝痔柴シウK 101111001100010110111100101001011011110011000100100011101010010010001110110001001000111011011000100011101100010110001110101001001000111011000100101111001011001101001011 bcc5bca5bcc48ea48ec48ed88ec58ea48ec4bcb34b
EUC-JP シナシ・シト痔柴借芝痔柴シウK 1000111010111100100011101100010110001110101111001000111010100101100011101011110010001110110001001011110010100110101111001100011010111100110110101011110011000111101111001010011010111100110001101000111010111100100011101011001101001011 8ebc8ec58ebc8ea58ebc8ec4bca6bcc6bcdabcc7bca6bcc68ebc8eb34b
UTF-8 シナシ・シト痔柴借芝痔柴シウK 11101111101111011011110011101111101111101000010111101111101111011011110011101111101111011010010111101111101111011011110011101111101111101000010011100111100101111001010011100110100111111011010011100101100000001001111111101000100010101001110111100111100101111001010011100110100111111011010011101111101111011011110011101111101111011011001101001011 efbdbcefbe85efbdbcefbda5efbdbcefbe84e79794e69fb4e5809fe88a9de79794e69fb4efbdbcefbdb34b
UHC ??????痔柴借芝痔柴??K 001111110011111100111111001111110011111100111111111101101100000011100011110000111111001110101000111100101011100111110110110000001110001111000011001111110011111101001011 3f3f3f3f3f3ff6c0e3c3f3a8f2b9f6c0e3c33f3f4b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)