To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 躍??揖??惟??艾??違?ゥ怨??畑 100101101111010000111111001111111001011101001011001111110011111110001000110100100011111100111111111001001000100000111111001111111000100011100001001111111000001101000100100010011000010100111111001111111001010010101000 96f43f3f974b3f3f88d23f3fe4883f3f88e13f834489853f3f94a8
EUC-JP 躍??揖??惟??艾??違?ゥ怨??畑 110011001111011000111111001111111100110110101100001111110011111110110000110101000011111100111111111001111110100000111111001111111011000011100011001111111010010110100101101100011110010100111111001111111100100010101010 ccf63f3fcdac3f3fb0d43f3fe7e83f3fb0e33fa5a5b1e53f3fc8aa
UTF-8 躍노씛揖욕컜惟듭뒳艾싲톩違쇤ゥ怨븍눜畑 111010001011101010001101111010111000010110111000111011001001010010011011111001101000111110010110111011001001101010010101111011001011101110011100111001101000001110011111111010111001001110101101111010111001001010110011111010001000100110111110111011001000101110110010111011011000011010101001111010011000000110010101111011001000011110100100111000111000001010100101111001101000000010101000111010111011100010001101111010111000100010011100111001111001010110010001 e8ba8deb85b8ec949be68f96ec9a95ecbb9ce6839feb93adeb92b3e889beec8bb2ed86a9e98195ec87a4e382a5e680a8ebb88deb889ce79591
UHC 躍노씛揖욕컜惟듭뒳艾싲톩違쇤ゥ怨븍눜畑 1110010110111000101100111110101110011101101100001110101111100111101111111110010110110000100001111110101011101110101101011110110010001010101011001110010011110101100110101110101110110111100000011110101011011110101111001110100110101011101001011110101010110011101110101110101110000111101101001110111110100101 e5b8b3eb9db0ebe7bfe5b087eaeeb5ec8aace4f59aebb781eadebce9aba5eab3baeb87b4efa5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)