To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 沃???ц?押??瘟????ぐ癌??瘟?? 10010111100000000011111100111111001111111000010010001000001111111000100110011111001111110011111111100001100010010011111100111111001111110011111110000010101011101000101011100000001111110011111111100001100010010011111100111111 97803f3f3f84883f899f3f3fe1893f3f3f3f82ae8ae03f3fe1893f3f
EUC-JP 沃???ц?押??瘟????ぐ癌??瘟?? 11001101111000000011111100111111001111111010011111101000001111111011001010100001001111110011111111100001111010010011111100111111001111110011111110100100101100001011010011100010001111110011111111100001111010010011111100111111 cde03f3f3fa7e83fb2a13f3fe1e93f3f3f3fa4b0b4e23f3fe1e93f3f
UTF-8 沃겼겢歷ц갬押뜹컟瘟룩큹呂묋ぐ癌닸릍瘟룡릍 1110011010110010100000111110101010110010101111001110101010110010101000101110111110100110100011001101000110000110111010101011000010101100111001101000101010111100111010111001110010111001111011001011101110011111111001111001100010011111111010111010001110101001111011011000000110111001111011111010011010000000111010111010110010001011111000111000000110010000111001111001100110001100111010111000101110111000111010111010011010001101111001111001100010011111111010111010001110100001111010111010011010001101 e6b283eab2bceab2a2efa68cd186eab0ace68abceb9cb9ecbb9fe7989feba3a9ed81b9efa680ebac8be38190e7998ceb8bb8eba68de7989feba3a1eba68d
UHC 沃겼겢歷ц갬押뜹컟瘟룩큹呂묋ぐ癌닸릍瘟룡릍 111010001010101010110000111001011000000110110100111001101011100010101100111010001011000010110111111001001110001110110110111001011011000010001010111010001011000010110111111010001011010010001000111001011111101110010001111010001010101010110000111001001101111110110100111001101011100010101100111010001011000010110111111001101011100010101100 e8aab0e581b4e6b8ace8b0b7e4e3b6e5b08ae8b0b7e8b488e5fb91e8aab0e4dfb4e6b8ace8b0b7e6b8ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)