To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 儀?????儀???????????松ユ? 10001011010101100011111100111111001111110011111100111111100010110101011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000111110111100100000111000011000111111 8b563f3f3f3f3f8b563f3f3f3f3f3f3f3f3f3f3f8fbc83863f
EUC-JP 儀?????儀???????????松ユ? 10110101101101110011111100111111001111110011111100111111101101011011011100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011111010111110101001011110011000111111 b5b73f3f3f3f3fb5b73f3f3f3f3f3f3f3f3f3f3fbebea5e63f
UTF-8 儀붾졂溜딁퇌儀붾졂溜딅죾琉룬썿李딇븠松ユ쉪 111001011000010010000000111010111011011010111110111011001010000110000010111011111010011110001011111010111001010010000001111011011000011110001100111001011000010010000000111010111011011010111110111011001010000110000010111011111010011110001011111010111001010010000101111011001010001110111110111011111010011110001100111010111010001110101100111011001000110110111111111011111010011110100001111010111001010010000111111010111011100010100000111001101001110110111110111000111000001110100110111011001000100110101010 e58480ebb6beeca182efa78beb9481ed878ce58480ebb6beeca182efa78beb9485eca3beefa78ceba3acec8dbfefa7a1eb9487ebb8a0e69dbee383a6ec89aa
UHC 儀붾졂溜딁퇌儀붾졂溜딅죾琉룬썿李딇븠松ユ쉪 111010111111000010010100111010111010000010110011111010101111111010001010111001111011011110011101111010111111000010010100111010111010000010110011111010101111111010001010111010111010000110010110111010111010010010110111111010011001101110101001111011001011000010001010111011011001010110001001111000011110011010101011111001101001101010000100 ebf094eba0b3eafe8ae7b79debf094eba0b3eafe8aeba196eba4b7e99ba9ecb08aed9589e1e6abe69a84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)