To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 渦??娃??受?????節??円??澳??^ 10001001010100010011111100111111100010001010000100111111001111111000111011110011001111110011111100111111001111110011111110010000110111110011111100111111100010010111111000111111001111111110000001010011001111110011111101011110 89513f3f88a13f3f8ef33f3f3f3f3f90df3f3f897e3f3fe0533f3f5e
EUC-JP 渦??娃??受?????節??円??澳??^ 10110001101100100011111100111111101100001010001100111111001111111011110011110101001111110011111100111111001111110011111111000000111000010011111100111111101100011101111100111111001111111101111110110100001111110011111101011110 b1b23f3fb0a33f3fbcf53f3f3f3f3fc0e13f3fb1df3f3fdfb43f3f5e
UTF-8 渦뤄슉娃꿩솦受썼젃寧좄쓿節들겛円잂뼺澳뉔죳^ 11100110101110001010011011101011101001001000010011101100100010101000100111100101101010001000001111101010101111111010100111101100100001101010011011100101100011111001011111101100100011011011110011101100101000001000001111101111101001101010101011101100101000101000010011101100100100111011111111100111101011111000000011101011100100111010010011101010101100101001101111100101100001101000011011101100100111101000001011101011101111001011101011100110101111101011001111101011100010011001010011101100101000111011001101011110 e6b8a6eba484ec8a89e5a883eabfa9ec86a6e58f97ec8dbceca083efa6aaeca284ec93bfe7af80eb93a4eab29be58686ec9e82ebbcbae6beb3eb8994eca3b35e
UHC 渦뤄슉娃꿩솦受썼젃寧좄쓿節들겛円잂뼺澳뉔죳^ 11101000101111101011011111101111101111011011010111101000110111111011001011100110100110011001111111100001111101001011110111101000101000001000011111100111101011001010000011101000101111101011011111101111101111011011010111101001100000011011001011100101111101111001111111100010100101101011110111100111111111101000011111101001101000011000111001011110 e8beb7efbdb5e8dfb2e6999fe1f4bde8a087e7aca0e8beb7efbdb5e981b2e5f79fe296bde7fe87e9a18e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)