To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????TB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5442
SJIS-WIN シナシォシト捨柴シスシナショシトシアシト執舎TB 10111100110001011011110010101011101111001100010010001110110011001000111011000100101111001011110110111100110001011011110010101110101111001100010010111100101100011011110011000100100011101011011110001110110010010101010001000010 bcc5bcabbcc48ecc8ec4bcbdbcc5bcaebcc4bcb1bcc48eb78ec95442
EUC-JP シナシォシト捨柴シスシナショシトシアシト執舎TB 10001110101111001000111011000101100011101011110010001110101010111000111010111100100011101100010010111100110011101011110011000110100011101011110010001110101111011000111010111100100011101100010110001110101111001000111010101110100011101011110010001110110001001000111010111100100011101011000110001110101111001000111011000100101111001011100110111100110010110101010001000010 8ebc8ec58ebc8eab8ebc8ec4bccebcc68ebc8ebd8ebc8ec58ebc8eae8ebc8ec48ebc8eb18ebc8ec4bcb9bccb5442
UTF-8 シナシォシト捨柴シスシナショシトシアシト執舎TB 1110111110111101101111001110111110111110100001011110111110111101101111001110111110111101101010111110111110111101101111001110111110111110100001001110011010001101101010001110011010011111101101001110111110111101101111001110111110111101101111011110111110111101101111001110111110111110100001011110111110111101101111001110111110111101101011101110111110111101101111001110111110111110100001001110111110111101101111001110111110111101101100011110111110111101101111001110111110111110100001001110010110011111101101111110100010001000100011100101010001000010 efbdbcefbe85efbdbcefbdabefbdbcefbe84e68da8e69fb4efbdbcefbdbdefbdbcefbe85efbdbcefbdaeefbdbcefbe84efbdbcefbdb1efbdbcefbe84e59fb7e8888e5442
UHC ??????捨柴????????????執?TB 001111110011111100111111001111110011111100111111110111101101011111100011110000110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111111001011111011001111110101010001000010 3f3f3f3f3f3fded7e3c33f3f3f3f3f3f3f3f3f3f3f3ff2fb3f5442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)