To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 穩??傲??猥??}穩??傲??猥??{^ 111000100111001000111111001111111001100011111100001111110011111111100000110011100011111100111111011111011110001001110010001111110011111110011000111111000011111100111111111000001100111000111111001111110111101101011110 e2723f3f98fc3f3fe0ce3f3f7de2723f3f98fc3f3fe0ce3f3f7b5e
EUC-JP 穩??傲??猥??}穩??傲??猥??{^ 111000111101001100111111001111111101000011111110001111110011111111100000110100000011111100111111011111011110001111010011001111110011111111010000111111100011111100111111111000001101000000111111001111110111101101011110 e3d33f3fd0fe3f3fe0d03f3f7de3d33f3fd0fe3f3fe0d03f3f7b5e
UTF-8 穩먨ㄵ傲긺맏猥롥닂}穩먨ㄵ傲긺맏猥롥닂{^ 111001111010100110101001111010111010100010101000111000111000010010110101111001011000001010110010111010101011100010111010111010111010011110001111111001111000110010100101111010111010000110100101111010111000101110000010011111011110011110101001101010011110101110101000101010001110001110000100101101011110010110000010101100101110101010111000101110101110101110100111100011111110011110001100101001011110101110100001101001011110101110001011100000100111101101011110 e7a9a9eba8a8e384b5e582b2eab8baeba78fe78ca5eba1a5eb8b827de7a9a9eba8a8e384b5e582b2eab8baeba78fe78ca5eba1a5eb8b827b5e
UHC 穩먨ㄵ傲긺맏猥롥닂}穩먨ㄵ傲긺맏猥롥닂{^ 111010001011000110010000111001011010010010100101111001111110110010110001111001111011100010111010111010001110010110001110111001011000100010001011011111011110100010110001100100001110010110100100101001011110011111101100101100011110011110111000101110101110100011100101100011101110010110001000100010110111101101011110 e8b190e5a4a5e7ecb1e7b8bae8e58ee5888b7de8b190e5a4a5e7ecb1e7b8bae8e58ee5888b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)