To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 濡??軟??畑??????????ⅳ猷??^ 100101000100011100111111001111111001001111101110001111110011111110010100101010000011111100111111001111110011111100111111001111110011111100111111001111110011111111111010010000111001011101010001001111110011111101011110 94473f3f93ee3f3f94a83f3f3f3f3f3f3f3f3f3ffa4397513f3f5e
EUC-JP 濡??軟??畑???????????猷??^ 1100011110101000001111110011111111000110111100000011111100111111110010001010101000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111100110110110010001111110011111101011110 c7a83f3fc6f03f3fc8aa3f3f3f3f3f3f3f3f3f3f3fcdb23f3f5e
UTF-8 濡띾젡軟숇ㅎ畑ㅻ짔吏묐죲溜싩뀢琉억ⅳ猷앹쭛^ 11100110101111111010000111101011100111011011111011101100101000001010000111101000101110111001111111101100100010001000011111100011100001011000111011100111100101011001000111100011100001011011101111101100101001111001010011101111101001111001111011101011101011001001000011101100101000111011001011101111101001111000101111101100100010111010100111101011100000001010001011101111101001111000110011101100100101101011010111100010100001011011001111100111100011001011011111101100100101011011100111101100101011011001101101011110 e6bfa1eb9dbeeca0a1e8bb9fec8887e3858ee79591e385bbeca794efa79eebac90eca3b2efa78bec8ba9eb80a2efa78cec96b5e285b3e78cb7ec95b9ecad9b5e
UHC 濡띾젡軟숇ㅎ畑ㅻ짔吏묐죲溜싩뀢琉억ⅳ猷앹쭛^ 11101011101000011000110111101011101000001001101011100110111000111001100111101011101001001011111011101111101001011010010011101011101000111001110111101100101001111001000111101011101000011000110111101010111111101001101011100111100001011001100111101011101001001011111011101111101001011010010011101011101000111001110111101100101001111001000101011110 eba18deba09ae6e399eba4beefa5a4eba39deca791eba18deafe9ae78599eba4beefa5a4eba39deca7915e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)