To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 馭??筌??泣??遺??筌≪????循?? 11101001011001100011111100111111111000101010001100111111001111111000101110000011001111110011111110001000111000100011111100111111111000101010001110000001111000010011111100111111001111110011111110001111011110100011111100111111 e9663f3fe2a33f3f8b833f3f88e23f3fe2a381e13f3f3f3f8f7a3f3f
EUC-JP 馭??筌??泣??遺??筌≪????循?? 11110001110001110011111100111111111001001010010100111111001111111011010111100011001111110011111110110000111001000011111100111111111001001010010110100010111000110011111100111111001111110011111110111101110110110011111100111111 f1c73f3fe4a53f3fb5e33f3fb0e43f3fe4a5a2e33f3f3f3fbddb3f3f
UTF-8 馭곸퍞筌듐굧泣낉㎖遺우퐷筌≪눘李볩㏊循띿뒠 111010011010011010101101111010101011001110111000111011011000110110011110111001111010110110001100111010111001001110010000111010101011010110100111111001101011001110100011111010111000001010001001111000111000111010010110111010011000000110111010111011001001101010110000111011011001000010110111111001111010110110001100111000101000100110101010111010111000100010011000111011111010011110100001111010111011001110101001111000111000111110001010111001011011111010101010111010111001110110111111111010111001001010100000 e9a6adeab3b8ed8d9ee7ad8ceb9390eab5a7e6b3a3eb8289e38e96e981baec9ab0ed90b7e7ad8ce289aaeb8898efa7a1ebb3a9e38f8ae5beaaeb9dbfeb92a0
UHC 馭곸퍞筌듐굧泣낉㎖遺우퐷筌≪눘李볩㏊循띿뒠 111001011101111110000001111011001011101110010101111011111010011110110101111000111000001010001101111010111110100010000101111011111010011110100010111010111011011010111111111011001011110110100000111011111010011110100001111011001000011110110001111011001011000010010011111011111010011110110101111000101110000010001101111011001000101010011100 e5df81ecbb95efa7b5e3828debe885efa7a2ebb6bfecbda0efa7a1ec87b1ecb093efa7b5e2e08dec8a9c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)