To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?夭?瀕??????枇申?????瀕???^ 001111111001101011101110001111111001010101101101001111110011111100111111001111110011111100111111100101001111100010010000010111000011111100111111001111110011111100111111100101010110110100111111001111110011111101011110 3f9aee3f956d3f3f3f3f3f3f94f8905c3f3f3f3f3f956d3f3f3f5e
EUC-JP ?夭?瀕??????枇申?????瀕???^ 001111111101010011110000001111111100100111001110001111110011111100111111001111110011111100111111110010001111101010111111101111010011111100111111001111110011111100111111110010011100111000111111001111110011111101011110 3fd4f03fc9ce3f3f3f3f3f3fc8fabfbd3f3f3f3f3fc9ce3f3f3f5e
UTF-8 뤵夭툘瀕렋렣뤰탮죳죳枇申렪뤰탮쥙킃瀕렊렊롘^ 11101011101001001011010111100101101001001010110111101101100010001001100011100111100000001001010111101011101000001000101111101011101000001010001111101011101001001011000011101101100000111010111011101100101000111011001111101100101000111011001111100110100111101000011111100111100101001011001111101011101000001010101011101011101001001011000011101101100000111010111011101100101001011001100111101101100000101000001111100111100000001001010111101011101000001000101011101011101000001000101011101011101000011001100001011110 eba4b5e5a4aded8898e78095eba08beba0a3eba4b0ed83aeeca3b3eca3b3e69e87e794b3eba0aaeba4b0ed83aeeca599ed8283e78095eba08aeba08aeba1985e
UHC 뤵夭툘瀕렋렣뤰탮죳죳枇申렪뤰탮쥙킃瀕렊렊롘^ 10001111111000111110100011101100101110001000111111011110101101011000111010100010100011101011010010001111110111101011010110001110101000011000111010100001100011101101110111101101111000111110100110001110101110001000111111011110101101011000111010100010100011101011010010001111110111101011010110001110101000011000111010100001100011101101110001011110 8fe3e8ecb88fdeb58ea28eb48fdeb58ea18ea18eddede3e98eb88fdeb58ea28eb48fdeb58ea18ea18edc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)