To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????彦?愈輕臂????彦?愈輕臂B 001111110011111100111111001111111001010101000110001111111001011011111010111001110110101011100100010111010011111100111111001111110011111110010101010001100011111110010110111110101110011101101010111001000101110101000010 3f3f3f3f95463f96fae76ae45d3f3f3f3f95463f96fae76ae45d42
EUC-JP 嫄???彦?愈輕臂嫄???彦?愈輕臂B 10001111101110101010000100111111001111110011111111001001101001110011111111001100111111001110110111001011111001111011111010001111101110101010000100111111001111110011111111001001101001110011111111001100111111001110110111001011111001111011111001000010 8fbaa13f3f3fc9a73fccfcedcbe7be8fbaa13f3f3fc9a73fccfcedcbe7be42
UTF-8 嫄렎롈렮彦렊愈輕臂嫄렎롈렮彦렊愈輕臂B 11100101101010111000010011101011101000001000111011101011101000011000100011101011101000001010111011100101101111011010011011101011101000001000101011100110100001001000100011101000101111001001010111101000100001111000001011100101101010111000010011101011101000001000111011101011101000011000100011101011101000001010111011100101101111011010011011101011101000001000101011100110100001001000100011101000101111001001010111101000100001111000001001000010 e5ab84eba08eeba188eba0aee5bda6eba08ae68488e8bc95e88782e5ab84eba08eeba188eba0aee5bda6eba08ae68488e8bc95e8878242
UHC 嫄렎롈렮彦렊愈輕臂嫄렎롈렮彦렊愈輕臂B 11101010101100011000111010100100100011101100111010001110101110111110010111101001100011101010000111101010111011111100110011101110110111101010001011101010101100011000111010100100100011101100111010001110101110111110010111101001100011101010000111101010111011111100110011101110110111101010001001000010 eab18ea48ece8ebbe5e98ea1eaefcceedea2eab18ea48ece8ebbe5e98ea1eaefcceedea242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)