To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壤??鎰??魏??永????????永??鎰? 1001101011011111001111110011111111101000010011000011111100111111111010011011000000111111001111111000100101101001001111110011111100111111001111110011111100111111001111110011111110001001011010010011111100111111111010000100110000111111 9adf3f3fe84c3f3fe9b03f3f89693f3f3f3f3f3f3f3f89693f3fe84c3f
EUC-JP 壤??鎰??魏??永????????永??鎰? 1101010011100001001111110011111111101111101011010011111100111111111100101011001000111111001111111011000111001010001111110011111100111111001111110011111100111111001111110011111110110001110010100011111100111111111011111010110100111111 d4e13f3fefad3f3ff2b23f3fb1ca3f3f3f3f3f3f3f3fb1ca3f3fefad3f
UTF-8 壤깆쥜鎰믥독魏됲뭽永띕봺罹숂독戮논뫕永띕쵓鎰갃 111001011010001110100100111010101011100110000110111011001010010110011100111010011000111010110000111010111010111110100101111010111000111110000101111010011010110110001111111010111001000010110010111010111010110110111101111001101011000010111000111010111001110110010101111010111011010010111010111011111010011110100110111011001000100010000010111010111000111110000101111011111010011110010010111010111000010110111100111010111010101110010101111001101011000010111000111010111001110110010101111011001011010110010011111010011000111010110000111010101011000010000011 e5a3a4eab986eca59ce98eb0ebafa5eb8f85e9ad8feb90b2ebadbde6b0b8eb9d95ebb4baefa7a6ec8882eb8f85efa792eb85bcebab95e6b0b8eb9d95ecb593e98eb0eab083
UHC 壤깆쥜鎰믥독魏됲뭽永띕봺罹숂독戮논뫕永띕쵓鎰갃 11100101101111011011000111101100101000101001000111101100111100001001001011100111101101011011011011101010111000001000100111101101100100101000110011100111101101011011011011101011100101001000000111101100101110101001100111100111101101011011011011101011101111011011001111101101100100011011011111100111101101011011011011101011101011001001010111101100111100001000000101000010 e5bdb1eca291ecf092e7b5b6eae089ed928ce7b5b6eb9481ecba99e7b5b6ebbdb3ed91b7e7b5b6ebac95ecf08142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)