To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????U}????????U{^ 001111110011111100111111001111110011111100111111001111110011111101010101011111010011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 邊「霆ク螯コ逍セU}邊「霆ク螯コ逍セU{^ 1110011110110010101000101110100010111011101110001110010110100110101110101110011110010110101111100101010101111101111001111011001010100010111010001011101110111000111001011010011010111010111001111001011010111110010101010111101101011110 e7b2a2e8bbb8e5a6bae796be557de7b2a2e8bbb8e5a6bae796be557b5e
EUC-JP 邊「霆ク螯コ逍セU}邊「霆ク螯コ逍セU{^ 11101110101101001000111010100010111100001011110110001110101110001110101010101000100011101011101011101101111101101000111010111110010101010111110111101110101101001000111010100010111100001011110110001110101110001110101010101000100011101011101011101101111101101000111010111110010101010111101101011110 eeb48ea2f0bd8eb8eaa88ebaedf68ebe557deeb48ea2f0bd8eb8eaa88ebaedf68ebe557b5e
UTF-8 邊「霆ク螯コ逍セU}邊「霆ク螯コ逍セU{^ 1110100110000010100010101110111110111101101000101110100110011100100001101110111110111101101110001110100010011110101011111110111110111101101110101110100110000000100011011110111110111101101111100101010101111101111010011000001010001010111011111011110110100010111010011001110010000110111011111011110110111000111010001001111010101111111011111011110110111010111010011000000010001101111011111011110110111110010101010111101101011110 e9828aefbda2e99c86efbdb8e89eafefbdbae9808defbdbe557de9828aefbda2e99c86efbdb8e89eafefbdbae9808defbdbe557b5e
UHC 邊?霆???逍?U}邊?霆???逍?U{^ 110111001010101100111111111011111111110100111111001111110011111111100001110011100011111101010101011111011101110010101011001111111110111111111101001111110011111100111111111000011100111000111111010101010111101101011110 dcab3feffd3f3f3fe1ce3f557ddcab3feffd3f3f3fe1ce3f557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)