To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???唯??儀??燁?????唯??儀??燁??B 00111111001111110011111110010111010000100011111100111111100010110101011000111111001111111111101101011001001111110011111100111111001111110011111110010111010000100011111100111111100010110101011000111111001111111111101101011001001111110011111101000010 3f3f3f97423f3f8b563f3ffb593f3f3f3f3f97423f3f8b563f3ffb593f3f42
EUC-JP ???唯??儀??燁?????唯??儀??燁??B 001111110011111100111111110011011010001100111111001111111011010110110111001111110011111110001111110010101011001100111111001111110011111100111111001111111100110110100011001111110011111110110101101101110011111100111111100011111100101010110011001111110011111101000010 3f3f3fcda33f3fb5b73f3f8fcab33f3f3f3f3fcda33f3fb5b73f3f8fcab33f3f42
UTF-8 嶺뚢돦唯섊뙼儀뤾콞燁삳몖嶺뚢돦唯섊뙼儀뤾콞燁삳몖B 11101111101001101010101111101011100110101010001011101011100011111010011011100101100101001010111111101100100001001000101011101011100110011011110011100101100001001000000011101011101001001011111011101100101111011001111011100111100001111000000111101100100000101011001111101011101010101001011011101111101001101010101111101011100110101010001011101011100011111010011011100101100101001010111111101100100001001000101011101011100110011011110011100101100001001000000011101011101001001011111011101100101111011001111011100111100001111000000111101100100000101011001111101011101010101001011001000010 efa6abeb9aa2eb8fa6e594afec848aeb99bce58480eba4beecbd9ee78781ec82b3ebaa96efa6abeb9aa2eb8fa6e594afec848aeb99bce58480eba4beecbd9ee78781ec82b3ebaa9642
UHC 嶺뚢돦唯섊뙼儀뤾콞燁삳몖嶺뚢돦唯섊뙼儀뤾콞燁삳몖B 11100111101011011000110011100010100010011010101011101010111001101001100011100111100011001011111111101011111100001000111111101010101100011001011011100111101001111011101111101011100100011000010011100111101011011000110011100010100010011010101011101010111001101001100011100111100011001011111111101011111100001000111111101010101100011001011011100111101001111011101111101011100100011000010001000010 e7ad8ce289aaeae698e78cbfebf08feab196e7a7bbeb9184e7ad8ce289aaeae698e78cbfebf08feab196e7a7bbeb918442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)