To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 畏??泣η????v畏??泣η????vB 100010001101100000111111001111111000101110000011100000111100010100111111001111110011111100111111011101101000100011011000001111110011111110001011100000111000001111000101001111110011111100111111001111110111011001000010 88d83f3f8b8383c53f3f3f3f7688d83f3f8b8383c53f3f3f3f7642
EUC-JP 畏??泣η????v畏??泣η????vB 101100001101101000111111001111111011010111100011101001101100011100111111001111110011111100111111011101101011000011011010001111110011111110110101111000111010011011000111001111110011111100111111001111110111011001000010 b0da3f3fb5e3a6c73f3f3f3f76b0da3f3fb5e3a6c73f3f3f3f7642
UTF-8 畏븍맮泣η툞流껋돩v畏븍맮泣η툞流껋돩vB 11100111100101011000111111101011101110001000110111101011101001111010111011100110101100111010001111001110101101111110110110001000100111101110111110100111100010101110101010111011100010111110101110001111101010010111011011100111100101011000111111101011101110001000110111101011101001111010111011100110101100111010001111001110101101111110110110001000100111101110111110100111100010101110101010111011100010111110101110001111101010010111011001000010 e7958febb88deba7aee6b3a3ceb7ed889eefa78aeabb8beb8fa976e7958febb88deba7aee6b3a3ceb7ed889eefa78aeabb8beb8fa97642
UHC 畏븍맮泣η툞流껋돩v畏븍맮泣η툞流껋돩vB 111010001110011010111010111010111001000010110101111010111110100010100101111001111011100010010101111010101111110010000011111011001000100110101100011101101110100011100110101110101110101110010000101101011110101111101000101001011110011110111000100101011110101011111100100000111110110010001001101011000111011001000010 e8e6baeb90b5ebe8a5e7b895eafc83ec89ac76e8e6baeb90b5ebe8a5e7b895eafc83ec89ac7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)