To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????RF}v????????RF}vB 00111111001111110011111100111111001111110011111100111111001111110101001001000110011111010111011000111111001111110011111100111111001111110011111100111111001111110101001001000110011111010111011001000010 3f3f3f3f3f3f3f3f52467d763f3f3f3f3f3f3f3f52467d7642
SJIS-WIN ?賂??諸???RF}v?賂??諸???RF}vB 0011111110011000010001110011111100111111100011111001010000111111001111110011111101010010010001100111110101110110001111111001100001000111001111110011111110001111100101000011111100111111001111110101001001000110011111010111011001000010 3f98473f3f8f943f3f3f52467d763f98473f3f8f943f3f3f52467d7642
EUC-JP 鋌賂??諸???RF}v鋌賂??諸???RF}vB 100011111110010010111011110011111010100000111111001111111011110111110100001111110011111100111111010100100100011001111101011101101000111111100100101110111100111110101000001111110011111110111101111101000011111100111111001111110101001001000110011111010111011001000010 8fe4bbcfa83f3fbdf43f3f3f52467d768fe4bbcfa83f3fbdf43f3f3f52467d7642
UTF-8 鋌賂렰렡諸쟉렰렮RF}v鋌賂렰렡諸쟉렰렮RF}vB 111010011000101110001100111010001011001110000010111010111010000010110000111010111010000010100001111010001010101110111000111011001001111110001001111010111010000010110000111010111010000010101110010100100100011001111101011101101110100110001011100011001110100010110011100000101110101110100000101100001110101110100000101000011110100010101011101110001110110010011111100010011110101110100000101100001110101110100000101011100101001001000110011111010111011001000010 e98b8ce8b382eba0b0eba0a1e8abb8ec9f89eba0b0eba0ae52467d76e98b8ce8b382eba0b0eba0a1e8abb8ec9f89eba0b0eba0ae52467d7642
UHC 鋌賂렰렡諸쟉렰렮RF}v鋌賂렰렡諸쟉렰렮RF}vB 1110111111111011110101101111000110001110101111011000111010110010111100001011001111000000111100011000111010111101100011101011101101010010010001100111110101110110111011111111101111010110111100011000111010111101100011101011001011110000101100111100000011110001100011101011110110001110101110110101001001000110011111010111011001000010 effbd6f18ebd8eb2f0b3c0f18ebd8ebb52467d76effbd6f18ebd8eb2f0b3c0f18ebd8ebb52467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)