To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 髷ヲ霆ク貉懃明髷ヲ霆ク﨟シ螟ア髷ヲ霆ク 111010011001111010100110111010001011101110111000111001101011100110011100111001111001011010111110111010011001111010100110111010001011101110111000111110111001110110111100111001011010010010110001111010011001111010100110111010001011101110111000 e99ea6e8bbb8e6b99ce796bee99ea6e8bbb8fb9dbce5a4b1e99ea6e8bbb8
EUC-JP 髷ヲ霆ク貉懃明髷ヲ霆ク?シ螟ア髷ヲ霆ク 11110001111111101000111010100110111100001011110110001110101110001110110010111011110110001110100111001100110000001111000111111110100011101010011011110000101111011000111010111000001111111000111010111100111010101010011010001110101100011111000111111110100011101010011011110000101111011000111010111000 f1fe8ea6f0bd8eb8ecbbd8e9ccc0f1fe8ea6f0bd8eb83f8ebceaa68eb1f1fe8ea6f0bd8eb8
UTF-8 髷ヲ霆ク貉懃明髷ヲ霆ク﨟シ螟ア髷ヲ霆ク 111010011010101110110111111011111011110110100110111010011001110010000110111011111011110110111000111010001011001010001001111001101000011110000011111001101001100010001110111010011010101110110111111011111011110110100110111010011001110010000110111011111011110110111000111011111010100010011111111011111011110110111100111010001001111010011111111011111011110110110001111010011010101110110111111011111011110110100110111010011001110010000110111011111011110110111000 e9abb7efbda6e99c86efbdb8e8b289e68783e6988ee9abb7efbda6e99c86efbdb8efa89fefbdbce89e9fefbdb1e9abb7efbda6e99c86efbdb8
UHC ??霆??懃明??霆???螟???霆? 00111111001111111110111111111101001111110011111111010000110001001101100110100101001111110011111111101111111111010011111100111111001111111101100110101101001111110011111100111111111011111111110100111111 3f3feffd3f3fd0c4d9a53f3feffd3f3f3fd9ad3f3f3feffd3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)