To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 弱??將??絶??絶??營??節わ?厓э?B 10001110111000110011111100111111100110111001001000111111001111111001000011100010001111110011111110010000111000100011111100111111100110100111101000111111001111111001000011011111100000101110110100111111111110101000110110000100100011110011111101000010 8ee33f3f9b923f3f90e23f3f90e23f3f9a7a3f3f90df82ed3ffa8d848f3f42
EUC-JP 弱??將??絶??絶??營??節わ?厓э?B 1011110011100101001111110011111111010101111100100011111100111111110000001110010000111111001111111100000011100100001111110011111111010011110110110011111100111111110000001110000110100100111011110011111110001111101101001100011110100111111011110011111101000010 bce53f3fd5f23f3fc0e43f3fc0e43f3fd3db3f3fc0e1a4ef3f8fb4c7a7ef3f42
UTF-8 弱녺퐤將딉쉠絶먨룷絶껃컛營랂꼯節わ풊厓э푵B 111001011011110010110001111010111000010110111010111011011001000010100100111001011011000010000111111010111001010010001001111011001000100110100000111001111011010110110110111010111010100010101000111010111010001110110111111001111011010110110110111010101011101110000011111011001011101110011011111001111000011110011111111010111001111010000010111010101011110010101111111001111010111110000000111000111000001010001111111011011001001010001010111001011000111010010011110100011000110111101101100100011011010101000010 e5bcb1eb85baed90a4e5b087eb9489ec89a0e7b5b6eba8a8eba3b7e7b5b6eabb83ecbb9be7879feb9e82eabcafe7af80e3828fed928ae58e93d18ded91b542
UHC 弱녺퐤將딉쉠絶먨룷絶껃컛營랂꼯節わ풊厓э푵B 11100101101100001000011011100111101111011000110111101101111000101000101011101111101111011010101011101111101111101001000011100101100011111010110011101111101111101000001111100101101100001000011011100111101111011000110111101110100001001000101011101111101111011010101011101111101111101001000011100100111011011010110011101111101111101000001101000010 e5b086e7bd8dede28aefbdaaefbe90e58facefbe83e5b086e7bd8dee848aefbdaaefbe90e4edacefbe8342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)