To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癌先昻善洩先癌先昻善洩厄癌先 10001010111000001001000011100110111110101101000010010001010100001000100101101011100100001110011010001010111000001001000011100110111110101101000010010001010100001000100101101011100101101110111110001010111000001001000011100110 8ae090e6fad09150896b90e68ae090e6fad09150896b96ef8ae090e6
EUC-JP 癌先?善洩先癌先?善洩厄癌先 1011010011100010110000001110100000111111110000011011000110110001110011001100000011101000101101001110001011000000111010000011111111000001101100011011000111001100110011001111000110110100111000101100000011101000 b4e2c0e83fc1b1b1ccc0e8b4e2c0e83fc1b1b1ccccf1b4e2c0e8
UTF-8 癌先昻善洩先癌先昻善洩厄癌先 111001111001100110001100111001011000010110001000111001101001100010111011111001011001011010000100111001101011010010101001111001011000010110001000111001111001100110001100111001011000010110001000111001101001100010111011111001011001011010000100111001101011010010101001111001011000111010000100111001111001100110001100111001011000010110001000 e7998ce58588e698bbe59684e6b4a9e58588e7998ce58588e698bbe59684e6b4a9e58e84e7998ce58588
UHC 癌先昻善洩先癌先昻善洩厄癌先 11100100110111111110000010111011111001001110100111100000101111001110000011011101111000001011101111100100110111111110000010111011111001001110100111100000101111001110000011011101111001001111100011100100110111111110000010111011 e4dfe0bbe4e9e0bce0dde0bbe4dfe0bbe4e9e0bce0dde4f8e4dfe0bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)