To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??裕??宋??烏?????諛??娃??旬 1110001010100011001111110011111110010111010101000011111100111111100100010111011000111111001111111000100101000111001111110011111100111111001111110011111111100110100001110011111100111111100010001010000100111111001111111000111101111011 e2a33f3f97543f3f91763f3f89473f3f3f3f3fe6873f3f88a13f3f8f7b
EUC-JP 筌??裕??宋??烏?????諛??娃??旬 1110010010100101001111110011111111001101101101010011111100111111110000011101011100111111001111111011000110101000001111110011111100111111001111110011111111101011111001110011111100111111101100001010001100111111001111111011110111011100 e4a53f3fcdb53f3fc1d73f3fb1a83f3f3f3f3febe73f3fb0a33f3fbddc
UTF-8 筌뚯뼍裕뺝뿗宋믩꽢烏띻퀋劉길퓴諛⑸쭫娃븐눨旬 111001111010110110001100111010111001101010101111111010111011110010001101111010001010001110010101111010111011101010011101111010111011111110010111111001011010111010001011111010111010111110101001111010101011110110100010111001111000001110001111111010111001110110111011111011011000000010001011111011111010011110000111111010101011100010111000111011011001001110110100111010001010101110011011111000101001000110111000111011001010110110101011111001011010100010000011111010111011100010010000111010111000100010101000111001101001011110101100 e7ad8ceb9aafebbc8de8a395ebba9debbf97e5ae8bebafa9eabda2e7838feb9dbbed808befa787eab8b8ed93b4e8ab9be291b8ecadabe5a883ebb890eb88a8e697ac
UHC 筌뚯뼍裕뺝뿗宋믩꽢烏띻퀋劉길퓴諛⑸쭫娃븐눨旬 1110111110100111100011001110110010010110100101011110101110101110100101011110010110010111100110101110000111100100100100101110101110000100101011111110100010100001100011011110101010110011100000011110101011100101101100011110011010111111100110101110101110110000101010011110101110100111100111111110100011011111101110101110110010000111101111111110001011100010 efa78cec9695ebae95e5979ae1e492eb84afe8a18deab381eae5b1e6bf9aebb0a9eba79fe8dfbaec87bfe2e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)