To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????f????^}Y????f????^}bE 00111111001111110011111100111111011001100011111100111111001111110011111101011110011111010101100100111111001111110011111100111111011001100011111100111111001111110011111101011110011111010110001001000101 3f3f3f3f663f3f3f3f5e7d593f3f3f3f663f3f3f3f5e7d6245
SJIS-WIN 閾搾、始f閾搾、始^}Y閾搾、始f閾搾、始^}bE 11101000100001111000110111101111101001001000111001101110011001101110100010000111100011011110111110100100100011100110111001011110011111010101100111101000100001111000110111101111101001001000111001101110011001101110100010000111100011011110111110100100100011100110111001011110011111010110001001000101 e8878defa48e6e66e8878defa48e6e5e7d59e8878defa48e6e66e8878defa48e6e5e7d6245
EUC-JP 閾搾、始f閾搾、始^}Y閾搾、始f閾搾、始^}bE 1110111111100111101110101111000110001110101001001011101111001111011001101110111111100111101110101111000110001110101001001011101111001111010111100111110101011001111011111110011110111010111100011000111010100100101110111100111101100110111011111110011110111010111100011000111010100100101110111100111101011110011111010110001001000101 efe7baf18ea4bbcf66efe7baf18ea4bbcf5e7d59efe7baf18ea4bbcf66efe7baf18ea4bbcf5e7d6245
UTF-8 閾搾、始f閾搾、始^}Y閾搾、始f閾搾、始^}bE 111010011001011010111110111001101001000010111110111011111011110110100100111001011010011110001011011001101110100110010110101111101110011010010000101111101110111110111101101001001110010110100111100010110101111001111101010110011110100110010110101111101110011010010000101111101110111110111101101001001110010110100111100010110110011011101001100101101011111011100110100100001011111011101111101111011010010011100101101001111000101101011110011111010110001001000101 e996bee690beefbda4e5a78b66e996bee690beefbda4e5a78b5e7d59e996bee690beefbda4e5a78b66e996bee690beefbda4e5a78b5e7d6245
UHC ?搾?始f?搾?始^}Y?搾?始f?搾?始^}bE 001111111111001110110110001111111110001110110111011001100011111111110011101101100011111111100011101101110101111001111101010110010011111111110011101101100011111111100011101101110110011000111111111100111011011000111111111000111011011101011110011111010110001001000101 3ff3b63fe3b7663ff3b63fe3b75e7d593ff3b63fe3b7663ff3b63fe3b75e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)