To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???靭?㎡???鴉??音??域??愉ゆ? 00111111001111110011111110010000011110000011111110000111011101010011111100111111001111111110100111101011001111110011111110001001101110010011111100111111100010001110011000111111001111111001011011111001100000101110010000111111 3f3f3f90783f87753f3f3fe9eb3f3f89b93f3f88e63f3f96f982e43f
EUC-JP ???靭??洹??鴉??音??域??愉ゆ? 0011111100111111001111111011111111011001001111110011111110001111110001111011101000111111001111111111001011101101001111110011111110110010101110110011111100111111101100001110100000111111001111111100110011111011101001001110011000111111 3f3f3fbfd93f3f8fc7ba3f3ff2ed3f3fb2bb3f3fb0e83f3fccfba4e63f
UTF-8 麗몃쓷靭뚳㎡洹잙쨨鴉롦룚音섎룆域밟뫁愉ゆ갭 111011111010011010001000111010111010101010000011111011001001001110110111111010011001110110101101111010111001101010110011111000111000111010100001111001101011010010111001111011001001111010011001111011001010100010101000111010011011010010001001111010111010000110100110111010111010001110011010111010011001111110110011111011001000010010001110111010111010001110000110111001011001111110011111111010111011000010011111111010111010101110000001111001101000010010001001111000111000001010000110111010101011000010101101 efa688ebaa83ec93b7e99dadeb9ab3e38ea1e6b4b9ec9e99eca8a8e9b489eba1a6eba39ae99fb3ec848eeba386e59f9febb09febab81e68489e38286eab0ad
UHC 麗몃쓷靭뚳㎡洹잙쨨鴉롦룚音섎룆域밟뫁愉ゆ갭 111001101011000010111000111010111001110110010100111011001110010110001100111011111010011110110011111010101011011110011111111010111010010010000011111001001011110010001110111001101000111110010110111010111110010110011000111010111000111110000101111001101011010010111001111000101001000110100101111010101111000010101010111001101011000010111000 e6b0b8eb9d94ece58cefa7b3eab79feba483e4bc8ee68f96ebe598eb8f85e6b4b9e291a5eaf0aae6b0b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)