To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??艤??擬??永?????醫??永??爾 1110001110100000001111110011111111100100011111100011111100111111100010110101101100111111001111111000100101101001001111110011111100111111001111110011111111100111110011100011111100111111100010010110100100111111001111111000111010100010 e3a03f3fe47e3f3f8b5b3f3f89693f3f3f3f3fe7ce3f3f89693f3f8ea2
EUC-JP 罌??艤??擬??永??嫄??醫??永??爾 11100110101000100011111100111111111001111101111100111111001111111011010110111100001111110011111110110001110010100011111100111111100011111011101010100001001111110011111111101110110100000011111100111111101100011100101000111111001111111011110010100100 e6a23f3fe7df3f3fb5bc3f3fb1ca3f3f8fbaa13f3feed03f3fb1ca3f3fbca4
UTF-8 罌삘뮪艤욕젽擬듭뒡永띔쑥嫄꿰럦醫묓뮏永띔퍌爾 111001111011110110001100111011001000001010011000111010111010111010101010111010001000100110100100111011001001101010010101111011001010000010111101111001101001001110101100111010111001001110101101111010111001001010100001111001101011000010111000111010111001110110010100111011001001000110100101111001011010101110000100111010101011111110110000111010111001111110100110111010011000011010101011111010111010110010010011111010111010111010001111111001101011000010111000111010111001110110010100111011011000110110001100111001111000100010111110 e7bd8cec8298ebaeaae889a4ec9a95eca0bde693aceb93adeb92a1e6b0b8eb9d94ec91a5e5ab84eabfb0eb9fa6e986abebac93ebae8fe6b0b8eb9d94ed8d8ce788be
UHC 罌삘뮪艤욕젽擬듭뒡永띔쑥嫄꿰럦醫묓뮏永띔퍌爾 1110010110100010101110111110001010010010101101001110101111111010101111111110010110100000101011111110101111110100101101011110110010001010100111011110011110110101101101101110101010111110101001101110101010110001101100101110011110001110100010011110110010100010100100011110110110010010100111001110011110110101101101101110101010111011100000111110110010110011 e5a2bbe292b4ebfabfe5a0afebf4b5ec8a9de7b5b6eabea6eab1b2e78e89eca291ed929ce7b5b6eabb83ecb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)