To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 庸?????鷹??[庸?????鷹??[^ 10010111011001100011111100111111001111110011111100111111100100011110100100111111001111110101101110010111011001100011111100111111001111110011111100111111100100011110100100111111001111110101101101011110 97663f3f3f3f3f91e93f3f5b97663f3f3f3f3f91e93f3f5b5e
EUC-JP 庸?????鷹??[庸?????鷹??[^ 11001101110001110011111100111111001111110011111100111111110000101110101100111111001111110101101111001101110001110011111100111111001111110011111100111111110000101110101100111111001111110101101101011110 cdc73f3f3f3f3fc2eb3f3f5bcdc73f3f3f3f3fc2eb3f3f5b5e
UTF-8 庸뉗뢾罹숅펶鷹곸틯[庸뉗뢾罹숅펶鷹곸틯[^ 111001011011101010111000111010111000100110010111111010111010001010111110111011111010011110100110111011001000100010000101111011011000111010110110111010011011011110111001111010101011001110111000111011011000101110101111010110111110010110111010101110001110101110001001100101111110101110100010101111101110111110100111101001101110110010001000100001011110110110001110101101101110100110110111101110011110101010110011101110001110110110001011101011110101101101011110 e5bab8eb8997eba2beefa7a6ec8885ed8eb6e9b7b9eab3b8ed8baf5be5bab8eb8997eba2beefa7a6ec8885ed8eb6e9b7b9eab3b8ed8baf5b5e
UHC 庸뉗뢾罹숅펶鷹곸틯[庸뉗뢾罹숅펶鷹곸틯[^ 111010011011110010000111111011001000111110000001111011001011101010011001111010011011110010000111111010111110110110000001111011001011101010011001010110111110100110111100100001111110110010001111100000011110110010111010100110011110100110111100100001111110101111101101100000011110110010111010100110010101101101011110 e9bc87ec8f81ecba99e9bc87ebed81ecba995be9bc87ec8f81ecba99e9bc87ebed81ecba995b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)