To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 姨??姨??恁???????????夷??^ 1001101101001000001111110011111110011011010010000011111100111111100111001000110000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000100011001110001111110011111101011110 9b483f3f9b483f3f9c8c3f3f3f3f3f3f3f3f3f3f3f88ce3f3f5e
EUC-JP 姨??姨??恁???????????夷??^ 1101010110101001001111110011111111010101101010010011111100111111110101111110110000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011000011010000001111110011111101011110 d5a93f3fd5a93f3fd7ec3f3f3f3f3f3f3f3f3f3f3fb0d03f3f5e
UTF-8 姨뚰슀姨뚯쭦恁㏃쮼吏뺤찈淋볦찈吏싴삨夷섏쭠^ 11100101101001111010100011101011100110101011000011101100100010101000000011100101101001111010100011101011100110101010111111101100101011011010011011100110100000011000000111100011100011111000001111101100101011101011110011101111101001111001111011101011101110101010010011101100101100001000100011101111101001111011010111101011101100111010011011101100101100001000100011101111101001111001111011101100100010111011010011101100100000101010100011100101101001001011011111101100100001001000111111101100101011011010000001011110 e5a7a8eb9ab0ec8a80e5a7a8eb9aafecada6e68181e38f83ecaebcefa79eebbaa4ecb088efa7b5ebb3a6ecb088efa79eec8bb4ec82a8e5a4b7ec848fecada05e
UHC 姨뚰슀姨뚯쭦恁㏃쮼吏뺤찈淋볦찈吏싴삨夷섏쭠^ 11101100101010011000110011101101100110101001001111101100101010011000110011101100101001111001101011101100111101101010011111101100101010001001100011101100101001111001010111101100101010011000110011101100111110001001001111101100101010011000110011101100101001111001101011101101100110001010011111101100101010001001100011101100101001111001010101011110 eca98ced9a93eca98ceca79aecf6a7eca898eca795eca98cecf893eca98ceca79aed98a7eca898eca7955e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)