To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????違??永??寃??乙j?永??鎰? 001111110011111100111111001111110011111100111111100010001110000100111111001111111000100101101001001111110011111110011011100000110011111100111111100010011011001110000010100010100011111110001001011010010011111100111111111010000100110000111111 3f3f3f3f3f3f88e13f3f89693f3f9b833f3f89b3828a3f89693f3fe84c3f
EUC-JP ???瑗??違??永??寃??乙j?永??鎰? 0011111100111111001111111000111111001100110000000011111100111111101100001110001100111111001111111011000111001010001111110011111111010101111000110011111100111111101100101011010110100011111010100011111110110001110010100011111100111111111011111010110100111111 3f3f3f8fccc03f3fb0e33f3fb1ca3f3fd5e33f3fb2b5a3ea3fb1ca3f3fefad3f
UTF-8 捻뚭여瑗뉒솈違먰뮊永띕벩寃쎽벧乙j쿆永띠룊鎰갃 111011111010011010100100111010111001101010101101111011001001011110101100111001111001000110010111111010111000100110010010111011001000011010001000111010011000000110010101111010111010100010110000111010111010111010001010111001101011000010111000111010111001110110010101111010111011001010101001111001011010111110000011111011001000111010111101111010111011001010100111111001001011100110011001111011111011110110001010111011001011111110000110111001101011000010111000111010111001110110100000111010111010001110001010111010011000111010110000111010101011000010000011 efa6a4eb9aadec97ace79197eb8992ec8688e98195eba8b0ebae8ae6b0b8eb9d95ebb2a9e5af83ec8ebdebb2a7e4b999efbd8aecbf86e6b0b8eb9da0eba38ae98eb0eab083
UHC 捻뚭여瑗뉒솈違먰뮊永띕벩寃쎽벧乙j쿆永띠룊鎰갃 11100110111101111000110011101010101111111010100111101010101111001000011111100111100110011000110011101010110111101001000011101101100100101001100011100111101101011011011011101011100100111011111111101010101100101001101111100100101110101010011011101011111000001010001111101010101100101001101111100111101101011011011011101100100011111000100111101100111100001000000101000010 e6f78ceabfa9eabc87e7998ceade90ed9298e7b5b6eb93bfeab29be4baa6ebe0a3eab29be7b5b6ec8f89ecf08142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)