To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 殿???蹄??壟??蹙?邵營昱????? 100100110110000100111111001111110011111110010010111110110011111100111111100110101110000000111111001111111110011101000101001111111110011110111000100110100111101011111010011000110011111100111111001111110011111100111111 93613f3f3f92fb3f3f9ae03f3fe7453fe7b89a7afa633f3f3f3f3f
EUC-JP 殿???蹄??壟??蹙?邵營昱???孼? 110001011100001000111111001111110011111111000100111111010011111100111111110101001110001000111111001111111110110110100110001111111110111010111010110100111101101110001111110000101010110100111111001111110011111110001111101110101100001100111111 c5c23f3f3fc4fd3f3fd4e23f3feda63feebad3db8fc2ad3f3f3f8fbac33f
UTF-8 殿댓렰렢蹄꿰렪壟뤉뤠蹙얠邵營昱눠咽멀孼뭍 111001101010111010111111111010111000110010010011111010111010000010110000111010111010000010100010111010001011100110000100111010101011111110110000111010111010000010101010111001011010001110011111111010111010010010001001111010111010010010100000111010001011100110011001111011001001011010100000111010011000001010110101111001111000011110011111111001101001100010110001111010111000100010100000111011111010011010011110111010111010100110000000111001011010110110111100111010111010110110001101 e6aebfeb8c93eba0b0eba0a2e8b984eabfb0eba0aae5a39feba489eba4a0e8b999ec96a0e982b5e7879fe698b1eb88a0efa69eeba980e5adbcebad8d
UHC 殿댓렰렢蹄꿰렪壟뤉뤠蹙얠邵營昱눠咽멀孼뭍 11101110111111001011010011110001100011101011110110001110101100111111000010110100101100101110011110001110101110001101011011100110100011111011100110110111111100011111010111101100101111101110110011100001110100001110011110111101111010011111000010110100101100101110011011101100101110001101011011100101111011011011100110110111 eefcb4f18ebd8eb3f0b4b2e78eb8d6e68fb9b7f1f5ecbeece1d0e7bde9f0b4b2e6ecb8d6e5edb9b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)