To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 馭??怡??幼??筌??肄▼?癒??筌??肄▼? 11101001011001100011111100111111100111000111110100111111001111111001011101100011001111110011111111100010101000110011111100111111111000111110010110000001101001010011111110010110111111000011111100111111111000101010001100111111001111111110001111100101100000011010010100111111 e9663f3f9c7d3f3f97633f3fe2a33f3fe3e581a53f96fc3f3fe2a33f3fe3e581a53f
EUC-JP 馭??怡??幼??筌™?肄▼?癒??筌™?肄▼? 1111000111000111001111110011111111010111110111100011111100111111110011011100010000111111001111111110010010100101100011111010001011101111001111111110011011100111101000101010011100111111110011001111111000111111001111111110010010100101100011111010001011101111001111111110011011100111101000101010011100111111 f1c73f3fd7de3f3fcdc43f3fe4a58fa2ef3fe6e7a2a73fccfe3f3fe4a58fa2ef3fe6e7a2a73f
UTF-8 馭곥룂怡곤㎗幼꾩뒍筌™뫁肄▼㎤癒곗뒗筌™뫁肄▼㎤ 111010011010011010101101111010101011001110100101111010111010001110000010111001101000000010100001111010101011001110100100111000111000111010010111111001011011100110111100111010101011111010101001111010111001001010001101111001111010110110001100111000101000010010100010111010111010101110000001111010001000001010000100111000101001011010111100111000111000111010100100111001111001100110010010111010101011001110010111111010111001001010010111111001111010110110001100111000101000010010100010111010111010101110000001111010001000001010000100111000101001011010111100111000111000111010100100 e9a6adeab3a5eba382e680a1eab3a4e38e97e5b9bceabea9eb928de7ad8ce284a2ebab81e88284e296bce38ea4e79992eab397eb9297e7ad8ce284a2ebab81e88284e296bce38ea4
UHC 馭곥룂怡곤㎗幼꾩뒍筌™뫁肄▼㎤癒곗뒗筌™뫁肄▼㎤ 111001011101111110000001111000111000111110000011111011001010111010110000111011111010011110100011111010101110101010000100111011001000101010001010111011111010011110100010111000101001000110100101111011001011110110100001111001011010011110101000111010111010100010110000111011001000101010010100111011111010011110100010111000101001000110100101111011001011110110100001111001011010011110101000 e5df81e38f83ecaeb0efa7a3eaea84ec8a8aefa7a2e291a5ecbda1e5a7a8eba8b0ec8a94efa7a2e291a5ecbda1e5a7a8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)