To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ?り?泣??誘??v?り?泣??誘??vB 001111111000001011101000001111111000101110000011001111110011111110010111010101010011111100111111011101100011111110000010111010000011111110001011100000110011111100111111100101110101010100111111001111110111011001000010 3f82e83f8b833f3f97553f3f763f82e83f8b833f3f97553f3f7642
EUC-JP ?り?泣??誘??v?り?泣??誘??vB 001111111010010011101010001111111011010111100011001111110011111111001101101101100011111100111111011101100011111110100100111010100011111110110101111000110011111100111111110011011011011000111111001111110111011001000010 3fa4ea3fb5e33f3fcdb63f3f763fa4ea3fb5e33f3fcdb63f3f7642
UTF-8 閭り내泣섇쮦誘ㅼ삒v閭り내泣섇쮦誘ㅼ삒vB 111011111010011010000110111000111000001010001010111010111000001010110100111001101011001110100011111011001000010010000111111011001010111010100110111010001010101010011000111000111000010110111100111011001000001010010010011101101110111110100110100001101110001110000010100010101110101110000010101101001110011010110011101000111110110010000100100001111110110010101110101001101110100010101010100110001110001110000101101111001110110010000010100100100111011001000010 efa686e3828aeb82b4e6b3a3ec8487ecaea6e8aa98e385bcec829276efa686e3828aeb82b4e6b3a3ec8487ecaea6e8aa98e385bcec82927642
UHC 閭り내泣섇쮦誘ㅼ삒v閭り내泣섇쮦誘ㅼ삒vB 111001101010110110101010111010101011001110111011111010111110100010011000111001011010100010000011111010111010111110100100111011001001100010010111011101101110011010101101101010101110101010110011101110111110101111101000100110001110010110101000100000111110101110101111101001001110110010011000100101110111011001000010 e6adaaeab3bbebe898e5a883ebafa4ec989776e6adaaeab3bbebe898e5a883ebafa4ec98977642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)