To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???姨????????????????B 00111111001111110011111110011011010010000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f9b483f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
EUC-JP ???姨???????ŀ????????B 001111110011111100111111110101011010100100111111001111110011111100111111001111110011111100111111100011111010100111001001001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3fd5a93f3f3f3f3f3f3f8fa9c93f3f3f3f3f3f3f3f42
UTF-8 溜삠뀛姨ⓥ뵗溜깅졎栒붿ŀ栒붾젨溜삠뀛梨뾗B 111011111010011110001011111011001000001010100000111010111000000010011011111001011010011110101000111000101001001110100101111010111011010110010111111011111010011110001011111010101011100110000101111011001010000110001110111001101010000010010010111010111011011010111111110001011000000011100110101000001001001011101011101101101011111011101100101000001010100011101111101001111000101111101100100000101010000011101011100000001001101111101111101001111010001011101011101111101001011101000010 efa78bec82a0eb809be5a7a8e293a5ebb597efa78beab985eca18ee6a092ebb6bfc580e6a092ebb6beeca0a8efa78bec82a0eb809befa7a2ebbe9742
UHC 溜삠뀛姨ⓥ뵗溜깅졎栒붿ŀ栒붾젨溜삠뀛梨뾗B 1110101011111110101110111110001110000101100101001110110010101001101010001110001010010100100110011110101011111110101100011110101110100000101110111110001011100011100101001110110010101001101010001110001011100011100101001110101110100000101000001110101011111110101110111110001110000101100101001110110010110001100101110101010001000010 eafebbe38594eca9a8e29499eafeb1eba0bbe2e394eca9a8e2e394eba0a0eafebbe38594ecb1975442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)