To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈺?????邑り?夜??認η?猷????? 11111011110001000011111100111111001111110011111100111111100101110101011110000010111010000011111110010110111010010011111100111111100101000100011010000011110001010011111110010111010100010011111100111111001111110011111100111111 fbc43f3f3f3f3f975782e83f96e93f3f944683c53f97513f3f3f3f3f
EUC-JP 鈺??佾??邑り?夜??認η?猷????? 10001111111000111101010100111111001111111000111110110000111110110011111100111111110011011011100010100100111010100011111111001100111010110011111100111111110001111010011110100110110001110011111111001101101100100011111100111111001111110011111100111111 8fe3d53f3f8fb0fb3f3fcdb8a4ea3fcceb3f3fc7a7a6c73fcdb23f3f3f3f3f
UTF-8 鈺쎄릿佾잍납邑り갔夜껋뮁認η삜猷⑷쉐略노풚 1110100110001000101110101110110010001110100001001110101110100110101111111110010010111101101111101110110010011110100011011110101110000010101010011110100110000010100100011110001110000010100010101110101010110000100101001110010110100100100111001110101010111011100010111110101110101110100000011110100010101010100011011100111010110111111011001000001010011100111001111000110010110111111000101001000110110111111011001000100110010000111011111010010110110110111010111000010110111000111011011001001010011010 e988baec8e84eba6bfe4bdbeec9e8deb82a9e98291e3828aeab094e5a49ceabb8bebae81e8aa8dceb7ec829ce78cb7e291b7ec8990efa5b6eb85b8ed929a
UHC 鈺쎄릿佾잍납邑り갔夜껋뮁認η삜猷⑷쉐略노풚 111010001010110110111101111010101011100010110100111011001110101110011111111001101011001110110011111010111110100110101010111010101011000010101100111001011010100010000011111011001001001010010000111011001110001110100101111001111001100010011111111010111010001110101001111010101011110110100110111001011011001010110011111010111011111010011101 e8adbdeab8b4eceb9fe6b3b3ebe9aaeab0ace5a883ec9290ece3a5e7989feba3a9eabda6e5b2b3ebbe9d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)