To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???癲??疫?え蟻???れ?疫?え蟻 001111110011111100111111111000011001111100111111001111111000100101110101001111111000001010100110100010110110000100111111001111110011111110000010111010100011111110001001011101010011111110000010101001101000101101100001 3f3f3fe19f3f3f89753f82a68b613f3f3f82ea3f89753f82a68b61
EUC-JP ???癲??疫?え蟻???れ?疫?え蟻 001111110011111100111111111000101010000100111111001111111011000111010110001111111010010010101000101101011100001000111111001111110011111110100100111011000011111110110001110101100011111110100100101010001011010111000010 3f3f3fe2a13f3fb1d63fa4a8b5c23f3f3fa4ec3fb1d63fa4a8b5c2
UTF-8 琉쀥쬂癲욎꽱疫욌え蟻붹툨劣れ뵪疫욌え蟻 111011111010011110001100111011001000000010100101111011001010110010000010111001111001100110110010111011001001101010001110111010101011110110110001111001111001011010101011111011001001101010001100111000111000000110001000111010001001111110111011111010111011011010111001111011011000100010101000111011111010011010011101111000111000001010001100111010111011010110101010111001111001011010101011111011001001101010001100111000111000000110001000111010001001111110111011 efa78cec80a5ecac82e799b2ec9a8eeabdb1e796abec9a8ce38188e89fbbebb6b9ed88a8efa69de3828cebb5aae796abec9a8ce38188e89fbb
UHC 琉쀥쬂癲욎꽱疫욌え蟻붹툨劣れ뵪疫욌え蟻 1110101110100100100101111110010110100110100110011110111110100110100111101110110010000100101111001110011010111001100111101110101110101010101010001110101111111100100101001110011010111000100111111110011011101011101010101110110010010100101010001110011010111001100111101110101110101010101010001110101111111100 eba497e5a699efa69eec84bce6b99eebaaa8ebfc94e6b89fe6ebaaec94a8e6b99eebaaa8ebfc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)