To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 業??油?????D業??油?????D^ 10001011110001100011111100111111100101101111101100111111001111110011111100111111001111110100010010001011110001100011111100111111100101101111101100111111001111110011111100111111001111110100010001011110 8bc63f3f96fb3f3f3f3f3f448bc63f3f96fb3f3f3f3f3f445e
EUC-JP 業??油??洧??D業??油??洧??D^ 1011011011001000001111110011111111001100111111010011111100111111100011111100011110110100001111110011111101000100101101101100100000111111001111111100110011111101001111110011111110001111110001111011010000111111001111110100010001011110 b6c83f3fccfd3f3f8fc7b43f3f44b6c83f3fccfd3f3f8fc7b43f3f445e
UTF-8 業볥뜈油믥독洧룸젃D業볥뜈油믥독洧룸젃D^ 111001101010010110101101111010111011001110100101111010111001110010001000111001101011001010111001111010111010111110100101111010111000111110000101111001101011010010100111111010111010001110111000111011001010000010000011010001001110011010100101101011011110101110110011101001011110101110011100100010001110011010110010101110011110101110101111101001011110101110001111100001011110011010110100101001111110101110100011101110001110110010100000100000110100010001011110 e6a5adebb3a5eb9c88e6b2b9ebafa5eb8f85e6b4a7eba3b8eca08344e6a5adebb3a5eb9c88e6b2b9ebafa5eb8f85e6b4a7eba3b8eca083445e
UHC 業볥뜈油믥독洧룸젃D業볥뜈油믥독洧룸젃D^ 111001011111011010010011111010111000110110001011111010101111101010010010111001111011010110110110111010101111101110110111111010111010000010000111010001001110010111110110100100111110101110001101100010111110101011111010100100101110011110110101101101101110101011111011101101111110101110100000100001110100010001011110 e5f693eb8d8beafa92e7b5b6eafbb7eba08744e5f693eb8d8beafa92e7b5b6eafbb7eba087445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)