To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 節??墺↑????節ヨ?與??辱??榮 100100001101111100111111001111111001101011010010100000011010101000111111001111110011111100111111100100001101111110000011100010000011111111100100011011110011111100111111100100000100101000111111001111111001111011000100 90df3f3f9ad281aa3f3f3f3f90df83883fe46f3f3f904a3f3f9ec4
EUC-JP 節??墺↑????節ヨ?與??辱??榮 110000001110000100111111001111111101010011010100101000101010110000111111001111110011111100111111110000001110000110100101111010000011111111100111110100000011111100111111101111111010101100111111001111111101110011000110 c0e13f3fd4d4a2ac3f3f3f3fc0e1a5e83fe7d03f3fbfab3f3fdcc6
UTF-8 節억쉬墺↑뮅廉뷂슭節ヨ덱與듸쉔辱녻풛榮 111001111010111110000000111011001001011010110101111011001000100110101100111001011010001010111010111000101000011010010001111010111010111010000101111011111010011010100010111010111011011110000010111011001000101010101101111001111010111110000000111000111000001110101000111010111000110110110001111010001000100010000111111010111001001110111000111011001000100110010100111010001011111010110001111010111000010110111011111011011001001010011011111001101010011010101110 e7af80ec96b5ec89ace5a2bae28691ebae85efa6a2ebb782ec8aade7af80e383a8eb8db1e88887eb93b8ec8994e8beb1eb85bbed929be6a6ae
UHC 節억쉬墺↑뮅廉뷂슭節ヨ덱與듸쉔辱녻풛榮 1110111110111101101111101110111110111101101011001110011111110010101000011110100010010010100101001110011011110101100101001110111110111101101111101110111110111101101010111110100010110101101001101110011010101000101101011110111110111101101010001110100110110100100001101110100010111110100111101110011110110100 efbdbeefbdace7f2a1e89294e6f594efbdbeefbdabe8b5a6e6a8b5efbda8e9b486e8be9ee7b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)