To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥?????濡??壬??濡??嚥??猷??^ 1001101010001011001111110011111100111111001111110011111110010100010001110011111110000001010010001001000001110000001111110011111110010100010001110011111100111111100110101000101100111111001111111001011101010001001111110011111101011110 9a8b3f3f3f3f3f94473f814890703f3f94473f3f9a8b3f3f97513f3f5e
EUC-JP 嚥?????濡??壬??濡??嚥??猷??^ 1101001111101011001111110011111100111111001111110011111111000111101010000011111110100001101010011011111111010001001111110011111111000111101010000011111100111111110100111110101100111111001111111100110110110010001111110011111101011110 d3eb3f3f3f3f3fc7a83fa1a9bfd13f3fc7a83f3fd3eb3f3fcdb23f3f5e
UTF-8 嚥좊죿溜길퓼濡쀫?壬쇗퓼濡쀫젿嚥좊죿猷욱빑^ 11100101100110101010010111101100101000101000101011101100101000111011111111101111101001111000101111101010101110001011100011101101100100111011110011100110101111111010000111101100100000001010101111101111101111001001111111100101101000111010110011101100100001111001011111101101100100111011110011100110101111111010000111101100100000001010101111101100101000001011111111100101100110101010010111101100101000101000101011101100101000111011111111100111100011001011011111101100100110101011000111101011101110011001000101011110 e59aa5eca28aeca3bfefa78beab8b8ed93bce6bfa1ec80abefbc9fe5a3acec8797ed93bce6bfa1ec80abeca0bfe59aa5eca28aeca3bfe78cb7ec9ab1ebb9915e
UHC 嚥좊죿溜길퓼濡쀫?壬쇗퓼濡쀫젿嚥좊죿猷욱빑^ 11100110101111111010000011101011101000011001011111101010111111101011000111100110101111111010000011101011101000011001011111101011101000111011111111101100111100111011110011100110101111111010000011101011101000011001011111101011101000001011000111100110101111111010000011101011101000011001011111101011101000111011111111101101100101011011010101011110 e6bfa0eba197eafeb1e6bfa0eba197eba3bfecf3bce6bfa0eba197eba0b1e6bfa0eba197eba3bfed95b55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)