To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^z?????????^zB 0011111100111111001111110011111100111111001111110011111100111111001111110101111001111010001111110011111100111111001111110011111100111111001111110011111100111111010111100111101001000010 3f3f3f3f3f3f3f3f3f5e7a3f3f3f3f3f3f3f3f3f5e7a42
SJIS-WIN ????げ?男??^z????げ?男??^zB 001111110011111100111111001111111000001010110000001111111001001001101010001111110011111101011110011110100011111100111111001111110011111110000010101100000011111110010010011010100011111100111111010111100111101001000010 3f3f3f3f82b03f926a3f3f5e7a3f3f3f3f82b03f926a3f3f5e7a42
EUC-JP ????げ?男??^z????げ?男??^zB 001111110011111100111111001111111010010010110010001111111100001111001011001111110011111101011110011110100011111100111111001111110011111110100100101100100011111111000011110010110011111100111111010111100111101001000010 3f3f3f3fa4b23fc3cb3f3f5e7a3f3f3f3fa4b23fc3cb3f3f5e7a42
UTF-8 룶웡룶첂げ룶男룶웡^z룶웡룶첂げ룶男룶웡^zB 1110101110100011101101101110110010011011101000011110101110100011101101101110110010110010100000101110001110000001100100101110101110100011101101101110011110010100101101111110101110100011101101101110110010011011101000010101111001111010111010111010001110110110111011001001101110100001111010111010001110110110111011001011001010000010111000111000000110010010111010111010001110110110111001111001010010110111111010111010001110110110111011001001101110100001010111100111101001000010 eba3b6ec9ba1eba3b6ecb282e38192eba3b6e794b7eba3b6ec9ba15e7aeba3b6ec9ba1eba3b6ecb282e38192eba3b6e794b7eba3b6ec9ba15e7a42
UHC 룶웡룶첂げ룶男룶웡^z룶웡룶첂げ룶男룶웡^zB 1000111110101011101111111111110110001111101010111010101010001111101010101011001010001111101010111101000111111011100011111010101110111111111111010101111001111010100011111010101110111111111111011000111110101011101010101000111110101010101100101000111110101011110100011111101110001111101010111011111111111101010111100111101001000010 8fabbffd8fabaa8faab28fabd1fb8fabbffd5e7a8fabbffd8fabaa8faab28fabd1fb8fabbffd5e7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)