To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????W}??????W{^ 0011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f577d3f3f3f3f3f3f577b5e
SJIS-WIN 鏑孟??沮?W}鏑孟??沮?W{^ 1001001101001100100101101101000000111111001111111001111110011100001111110101011101111101100100110100110010010110110100000011111100111111100111111001110000111111010101110111101101011110 934c96d03f3f9f9c3f577d934c96d03f3f9f9c3f577b5e
EUC-JP 鏑孟??沮?W}鏑孟??沮?W{^ 1100010110101101110011001101001000111111001111111101110111111100001111110101011101111101110001011010110111001100110100100011111100111111110111011111110000111111010101110111101101011110 c5adccd23f3fddfc3f577dc5adccd23f3fddfc3f577b5e
UTF-8 鏑孟렟렫沮촙W}鏑孟렟렫沮촙W{^ 1110100110001111100100011110010110101101100111111110101110100000100111111110101110100000101010111110011010110010101011101110110010110100100110010101011101111101111010011000111110010001111001011010110110011111111010111010000010011111111010111010000010101011111001101011001010101110111011001011010010011001010101110111101101011110 e98f91e5ad9feba09feba0abe6b2aeecb499577de98f91e5ad9feba09feba0abe6b2aeecb499577b5e
UHC 鏑孟렟렫沮촙W}鏑孟렟렫沮촙W{^ 1110111011101011110110001110101110001110101100001000111010111001111011101100000111000011110011110101011101111101111011101110101111011000111010111000111010110000100011101011100111101110110000011100001111001111010101110111101101011110 eeebd8eb8eb08eb9eec1c3cf577deeebd8eb8eb08eb9eec1c3cf577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)