To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN テーテ「ツ「ティツクテ」テーテ「ツ「ティツクテ」B 110000111011000011000011101000101100001010100010110000111010100011000010101110001100001110000001011101101100001110110000110000111010001011000010101000101100001110101000110000101011100011000011100000010111011001000010 c3b0c3a2c2a2c3a8c2b8c38176c3b0c3a2c2a2c3a8c2b8c3817642
EUC-JP テーテ「ツ「ティツクテ」テーテ「ツ「ティツクテ」B 10001110110000111000111010110000100011101100001110001110101000101000111011000010100011101010001010001110110000111000111010101000100011101100001010001110101110001000111011000011101000011101011110001110110000111000111010110000100011101100001110001110101000101000111011000010100011101010001010001110110000111000111010101000100011101100001010001110101110001000111011000011101000011101011101000010 8ec38eb08ec38ea28ec28ea28ec38ea88ec28eb88ec3a1d78ec38eb08ec38ea28ec28ea28ec38ea88ec28eb88ec3a1d742
UTF-8 テーテ「ツ「ティツクテ」テーテ「ツ「ティツクテ」B 11101111101111101000001111101111101111011011000011101111101111101000001111101111101111011010001011101111101111101000001011101111101111011010001011101111101111101000001111101111101111011010100011101111101111101000001011101111101111011011100011101111101111101000001111100011100000001000110111101111101111101000001111101111101111011011000011101111101111101000001111101111101111011010001011101111101111101000001011101111101111011010001011101111101111101000001111101111101111011010100011101111101111101000001011101111101111011011100011101111101111101000001111100011100000001000110101000010 efbe83efbdb0efbe83efbda2efbe82efbda2efbe83efbda8efbe82efbdb8efbe83e3808defbe83efbdb0efbe83efbda2efbe82efbda2efbe83efbda8efbe82efbdb8efbe83e3808d42
UHC ???????????」???????????」B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110100001101110010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111101000011011100101000010 3f3f3f3f3f3f3f3f3f3f3fa1b93f3f3f3f3f3f3f3f3f3f3fa1b942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)