To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ????坎???四}????坎???四{^ 00111111001111110011111100111111100110101010101000111111001111110011111110001110011011000111110100111111001111110011111100111111100110101010101000111111001111110011111110001110011011000111101101011110 3f3f3f3f9aaa3f3f3f8e6c7d3f3f3f3f9aaa3f3f3f8e6c7b5e
EUC-JP ????坎???四}????坎???四{^ 00111111001111110011111100111111110101001010110000111111001111110011111110111011110011010111110100111111001111110011111100111111110101001010110000111111001111110011111110111011110011010111101101011110 3f3f3f3fd4ac3f3f3fbbcd7d3f3f3f3fd4ac3f3f3fbbcd7b5e
UTF-8 쒔렋쑈뤰坎쳩쥙죳四}쒔렋쑈뤰坎쳩쥙죳四{^ 111011001001001010010100111010111010000010001011111011001001000110001000111010111010010010110000111001011001110110001110111011001011001110101001111011001010010110011001111011001010001110110011111001011001101110011011011111011110110010010010100101001110101110100000100010111110110010010001100010001110101110100100101100001110010110011101100011101110110010110011101010011110110010100101100110011110110010100011101100111110010110011011100110110111101101011110 ec9294eba08bec9188eba4b0e59d8eecb3a9eca599eca3b3e59b9b7dec9294eba08bec9188eba4b0e59d8eecb3a9eca599eca3b3e59b9b7b5e
UHC 쒔렋쑈뤰坎쳩쥙죳四}쒔렋쑈뤰坎쳩쥙죳四{^ 101111101010110110001110101000101011111010100100100011111101111011001010111011001010101110001110101000101000111010100001100011101101111011001100011111011011111010101101100011101010001010111110101001001000111111011110110010101110110010101011100011101010001010001110101000011000111011011110110011000111101101011110 bead8ea2bea48fdecaecab8ea28ea18edecc7dbead8ea2bea48fdecaecab8ea28ea18edecc7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)