To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN シタシホ湿質シセシミ疾濵質シセシアシタシホ湿質シセシミ疾濵質シセシアB 101111001100000010111100110011101000111010111100100011101011111110111100101111101011110011010000100011101011111011111011010011011000111010111111101111001011111010111100101100011011110011000000101111001100111010001110101111001000111010111111101111001011111010111100110100001000111010111110111110110100110110001110101111111011110010111110101111001011000101000010 bcc0bcce8ebc8ebfbcbebcd08ebefb4d8ebfbcbebcb1bcc0bcce8ebc8ebfbcbebcd08ebefb4d8ebfbcbebcb142
EUC-JP シタシホ湿質シセシミ疾濵質シセシアシタシホ湿質シセシミ疾濵質シセシアB 1000111010111100100011101100000010001110101111001000111011001110101111001011111010111100110000011000111010111100100011101011111010001110101111001000111011010000101111001100000010001111110010011010011010111100110000011000111010111100100011101011111010001110101111001000111010110001100011101011110010001110110000001000111010111100100011101100111010111100101111101011110011000001100011101011110010001110101111101000111010111100100011101101000010111100110000001000111111001001101001101011110011000001100011101011110010001110101111101000111010111100100011101011000101000010 8ebc8ec08ebc8ecebcbebcc18ebc8ebe8ebc8ed0bcc08fc9a6bcc18ebc8ebe8ebc8eb18ebc8ec08ebc8ecebcbebcc18ebc8ebe8ebc8ed0bcc08fc9a6bcc18ebc8ebe8ebc8eb142
UTF-8 シタシホ湿質シセシミ疾濵質シセシアシタシホ湿質シセシミ疾濵質シセシアB 11101111101111011011110011101111101111101000000011101111101111011011110011101111101111101000111011100110101110011011111111101000101100111010101011101111101111011011110011101111101111011011111011101111101111011011110011101111101111101001000011100111100101101011111011100110101111111011010111101000101100111010101011101111101111011011110011101111101111011011111011101111101111011011110011101111101111011011000111101111101111011011110011101111101111101000000011101111101111011011110011101111101111101000111011100110101110011011111111101000101100111010101011101111101111011011110011101111101111011011111011101111101111011011110011101111101111101001000011100111100101101011111011100110101111111011010111101000101100111010101011101111101111011011110011101111101111011011111011101111101111011011110011101111101111011011000101000010 efbdbcefbe80efbdbcefbe8ee6b9bfe8b3aaefbdbcefbdbeefbdbcefbe90e796bee6bfb5e8b3aaefbdbcefbdbeefbdbcefbdb1efbdbcefbe80efbdbcefbe8ee6b9bfe8b3aaefbdbcefbdbeefbdbcefbe90e796bee6bfb5e8b3aaefbdbcefbdbeefbdbcefbdb142
UHC ?????質????疾?質?????????質????疾?質????B 0011111100111111001111110011111100111111111100101111010100111111001111110011111100111111111100101111000000111111111100101111010100111111001111110011111100111111001111110011111100111111001111110011111111110010111101010011111100111111001111110011111111110010111100000011111111110010111101010011111100111111001111110011111101000010 3f3f3f3f3ff2f53f3f3f3ff2f03ff2f53f3f3f3f3f3f3f3f3ff2f53f3f3f3ff2f03ff2f53f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)