To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????G 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f47
SJIS-WIN 疾辞シナ疾痔セ燻ヒシャ踈シュシウ疾汐シ・疾痔シニG 1000111010111110100011101010101110111100110001011000111010111110100011101010010010111110111000001000111011001011101111001010110011100110111100011011110010101101101111001011001110001110101111101000111010101100101111001010010110001110101111101000111010100100101111001100011001000111 8ebe8eabbcc58ebe8ea4bee08ecbbcace6f1bcadbcb38ebe8eacbca58ebe8ea4bcc647
EUC-JP 疾辞シナ疾痔セ燻ヒシャ踈シュシウ疾汐シ・疾痔シニG 10111100110000001011110010101101100011101011110010001110110001011011110011000000101111001010011010001110101111101101111111101110100011101100101110001110101111001000111010101100111011001111001110001110101111001000111010101101100011101011110010001110101100111011110011000000101111001010111010001110101111001000111010100101101111001100000010111100101001101000111010111100100011101100011001000111 bcc0bcad8ebc8ec5bcc0bca68ebedfee8ecb8ebc8eacecf38ebc8ead8ebc8eb3bcc0bcae8ebc8ea5bcc0bca68ebc8ec647
UTF-8 疾辞シナ疾痔セ燻ヒシャ踈シュシウ疾汐シ・疾痔シニG 11100111100101101011111011101000101111101001111011101111101111011011110011101111101111101000010111100111100101101011111011100111100101111001010011101111101111011011111011100111100001111011101111101111101111101000101111101111101111011011110011101111101111011010110011101000101110001000100011101111101111011011110011101111101111011010110111101111101111011011110011101111101111011011001111100111100101101011111011100110101100011001000011101111101111011011110011101111101111011010010111100111100101101011111011100111100101111001010011101111101111011011110011101111101111101000011001000111 e796bee8be9eefbdbcefbe85e796bee79794efbdbee787bbefbe8befbdbcefbdace8b888efbdbcefbdadefbdbcefbdb3e796bee6b190efbdbcefbda5e796bee79794efbdbcefbe8647
UHC 疾???疾痔?燻????????疾汐??疾痔??G 111100101111000000111111001111110011111111110010111100001111011011000000001111111111110110111000001111110011111100111111001111110011111100111111001111110011111111110010111100001110000010110001001111110011111111110010111100001111011011000000001111110011111101000111 f2f03f3f3ff2f0f6c03ffdb83f3f3f3f3f3f3f3ff2f0e0b13f3ff2f0f6c03f3f47

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)