To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN セシ杓湿悉セシ疾湿実セシ疾シムシ、セマ蒔 10111110101111001000111011011011100011101011110010001110101110111011111010111100100011101011111010001110101111001000111011000000101111101011110010001110101111101011110011010001101111001010010010111110110011111000111010101010 bebc8edb8ebc8ebbbebc8ebe8ebc8ec0bebc8ebebcd1bca4becf8eaa
EUC-JP セシ杓湿悉セシ疾湿実セシ疾シムシ、セマ蒔 10001110101111101000111010111100101111001101110110111100101111101011110010111101100011101011111010001110101111001011110011000000101111001011111010111100110000101000111010111110100011101011110010111100110000001000111010111100100011101101000110001110101111001000111010100100100011101011111010001110110011111011110010101100 8ebe8ebcbcddbcbebcbd8ebe8ebcbcc0bcbebcc28ebe8ebcbcc08ebc8ed18ebc8ea48ebe8ecfbcac
UTF-8 セシ杓湿悉セシ疾湿実セシ疾シムシ、セマ蒔 111011111011110110111110111011111011110110111100111001101001110110010011111001101011100110111111111001101000001010001001111011111011110110111110111011111011110110111100111001111001011010111110111001101011100110111111111001011010111010011111111011111011110110111110111011111011110110111100111001111001011010111110111011111011110110111100111011111011111010010001111011111011110110111100111011111011110110100100111011111011110110111110111011111011111010001111111010001001001010010100 efbdbeefbdbce69d93e6b9bfe68289efbdbeefbdbce796bee6b9bfe5ae9fefbdbeefbdbce796beefbdbcefbe91efbdbcefbda4efbdbeefbe8fe89294
UHC ??杓?悉??疾????疾??????蒔 00111111001111111111100011110101001111111110001111111010001111110011111111110010111100000011111100111111001111110011111111110010111100000011111100111111001111110011111100111111001111111110001111001000 3f3ff8f53fe3fa3f3ff2f03f3f3f3ff2f03f3f3f3f3f3fe3c8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)