To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 劑?乙陌躪齊??魄劑?乙陌躪齊??白^ 10011001100111010011111110001001101100111110100010011001111001110101100011101010100011100011111100111111111010011010111010011001100111010011111110001001101100111110100010011001111001110101100011101010100011100011111100111111100101001001001001011110 999d3f89b3e899e758ea8e3f3fe9ae999d3f89b3e899e758ea8e3f3f94925e
EUC-JP 劑?乙陌躪齊??魄劑?乙陌躪齊??白^ 11010001111111010011111110110010101101011110111111111001111011011011100111110011111011100011111100111111111100101011000011010001111111010011111110110010101101011110111111111001111011011011100111110011111011100011111100111111110001111111001001011110 d1fd3fb2b5eff9edb9f3ee3f3ff2b0d1fd3fb2b5eff9edb9f3ee3f3fc7f25e
UTF-8 劑렓乙陌躪齊곁렱魄劑렓乙陌躪齊곁렱白^ 11100101100010101001000111101011101000001001001111100100101110011001100111101001100110011000110011101000101110101010101011101001101111011000101011101010101100111000000111101011101000001011000111101001101011011000010011100101100010101001000111101011101000001001001111100100101110011001100111101001100110011000110011101000101110101010101011101001101111011000101011101010101100111000000111101011101000001011000111100111100110011011110101011110 e58a91eba093e4b999e9998ce8baaae9bd8aeab381eba0b1e9ad84e58a91eba093e4b999e9998ce8baaae9bd8aeab381eba0b1e799bd5e
UHC 劑렓乙陌躪齊곁렱魄劑렓乙陌躪齊곁렱白^ 11110000101001011000111010101000111010111110000011011000111010001101011111110101111100001011101010110000111001111000111010111110110110111101111011110000101001011000111010101000111010111110000011011000111010001101011111110101111100001011101010110000111001111000111010111110110110111101110001011110 f0a58ea8ebe0d8e8d7f5f0bab0e78ebedbdef0a58ea8ebe0d8e8d7f5f0bab0e78ebedbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)