To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 塢??要??億?????節?????汚??^ 100110101100011100111111001111111001011101110110001111110011111110001001101011010011111100111111001111110011111100111111100100001101111100111111001111110011111100111111001111111000100110011000001111110011111101011110 9ac73f3f97763f3f89ad3f3f3f3f3f90df3f3f3f3f3f89983f3f5e
EUC-JP 塢??要??億?????節?????汚??^ 110101001100100100111111001111111100110111010111001111110011111110110010101011110011111100111111001111110011111100111111110000001110000100111111001111110011111100111111001111111011000111111000001111110011111101011110 d4c93f3fcdd73f3fb2af3f3f3f3f3fc0e13f3f3f3f3fb1f83f3f5e
UTF-8 塢뽳쉘要뺞캈億계웺寧좄뱰節⑶찕驪붹꽣汚녽죳^ 11100101101000011010001011101011101111011011001111101100100010011001100011101000101001101000000111101011101110101001111011101100101110101000100011100101100001001000010011101010101100111000010011101100100110111011101011101111101001101010101011101100101000101000010011101011101100011011000011100111101011111000000011100010100100011011011011101100101100001001010111101111101001101000011111101011101101101011100111101010101111011010001111100110101100011001101011101011100001011011110111101100101000111011001101011110 e5a1a2ebbdb3ec8998e8a681ebba9eecba88e58484eab384ec9bbaefa6aaeca284ebb1b0e7af80e291b6ecb095efa687ebb6b9eabda3e6b19aeb85bdeca3b35e
UHC 塢뽳쉘要뺞캈億계웺寧좄뱰節⑶찕驪붹꽣汚녽죳^ 11100111111100011001011011101111101111011010100111101001101010011001010111100110101011111001010011100101111000101011000011101000100111111000011011100111101011001010000011101000100100111001011011101111101111011010100111101001101010011001010111100110101011111001010011100110100001001011000011100111111111011000011011101001101000011000111001011110 e7f196efbda9e9a995e6af94e5e2b0e89f86e7aca0e89396efbda9e9a995e6af94e684b0e7fd86e9a18e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)