To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 譎帛クク邨り抄蜑雁香譎帛クク邨り抄蜑雁香 111001101001100110011011111001011011100010111000111001111011010110000010111010001000111110110100111001011000100110001010111001011000110110000001111001101001100110011011111001011011100010111000111001111011010110000010111010001000111110110100111001011000100110001010111001011000110110000001 e6999be5b8b8e7b582e88fb4e5898ae58d81e6999be5b8b8e7b582e88fb4e5898ae58d81
EUC-JP 譎帛クク邨り抄蜑雁香譎帛クク邨り抄蜑雁香 11101011111110011101011011100111100011101011100010001110101110001110111010110111101001001110101010111110101101101110100111101001101101001110011110111001111000011110101111111001110101101110011110001110101110001000111010111000111011101011011110100100111010101011111010110110111010011110100110110100111001111011100111100001 ebf9d6e78eb88eb8eeb7a4eabeb6e9e9b4e7b9e1ebf9d6e78eb88eb8eeb7a4eabeb6e9e9b4e7b9e1
UTF-8 譎帛クク邨り抄蜑雁香譎帛クク邨り抄蜑雁香 111010001010110110001110111001011011100010011011111011111011110110111000111011111011110110111000111010011000001010101000111000111000001010001010111001101000101010000100111010001001110010010001111010011001101110000001111010011010011010011001111010001010110110001110111001011011100010011011111011111011110110111000111011111011110110111000111010011000001010101000111000111000001010001010111001101000101010000100111010001001110010010001111010011001101110000001111010011010011010011001 e8ad8ee5b89befbdb8efbdb8e982a8e3828ae68a84e89c91e99b81e9a699e8ad8ee5b89befbdb8efbdb8e982a8e3828ae68a84e89c91e99b81e9a699
UHC 譎帛??邨り抄?雁香譎帛??邨り抄?雁香 11111101110100101101101111011001001111110011111111110101101111101010101011101010111101001111110000111111111001001101001011111010110001011111110111010010110110111101100100111111001111111111010110111110101010101110101011110100111111000011111111100100110100101111101011000101 fdd2dbd93f3ff5beaaeaf4fc3fe4d2fac5fdd2dbd93f3ff5beaaeaf4fc3fe4d2fac5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)