To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????\ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN シ」シ燻セ辞シホ疾汐シヤ疾汐シユ\ 1011110010100011101111001110000010001110101111101000111010101011101111001100111010001110101111101000111010101100101111001101010010001110101111101000111010101100101111001101010101011100 bca3bce08ebe8eabbcce8ebe8eacbcd48ebe8eacbcd55c
EUC-JP シ」シ燻セ辞シホ疾汐シヤ疾汐シユ\ 100011101011110010001110101000111000111010111100110111111110111010001110101111101011110010101101100011101011110010001110110011101011110011000000101111001010111010001110101111001000111011010100101111001100000010111100101011101000111010111100100011101101010101011100 8ebc8ea38ebcdfee8ebebcad8ebc8ecebcc0bcae8ebc8ed4bcc0bcae8ebc8ed55c
UTF-8 シ」シ燻セ辞シホ疾汐シヤ疾汐シユ\ 11101111101111011011110011101111101111011010001111101111101111011011110011100111100001111011101111101111101111011011111011101000101111101001111011101111101111011011110011101111101111101000111011100111100101101011111011100110101100011001000011101111101111011011110011101111101111101001010011100111100101101011111011100110101100011001000011101111101111011011110011101111101111101001010101011100 efbdbcefbda3efbdbce787bbefbdbee8be9eefbdbcefbe8ee796bee6b190efbdbcefbe94e796bee6b190efbdbcefbe955c
UHC ???燻????疾汐??疾汐??\ 00111111001111110011111111111101101110000011111100111111001111110011111111110010111100001110000010110001001111110011111111110010111100001110000010110001001111110011111101011100 3f3f3ffdb83f3f3f3ff2f0e0b13f3ff2f0e0b13f3f5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)