To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???TWznf???TWzn^}Y???TWznf???TWzn^}bE 00111111001111110011111101010100010101110111101001101110011001100011111100111111001111110101010001010111011110100110111001011110011111010101100100111111001111110011111101010100010101110111101001101110011001100011111100111111001111110101010001010111011110100110111001011110011111010110001001000101 3f3f3f54577a6e663f3f3f54577a6e5e7d593f3f3f54577a6e663f3f3f54577a6e5e7d6245
SJIS-WIN 迪晧юTWznf迪晧юTWzn^}Y迪晧юTWznf迪晧юTWzn^}bE 11100111100011001001110111101100100001001001000001010100010101110111101001101110011001101110011110001100100111011110110010000100100100000101010001010111011110100110111001011110011111010101100111100111100011001001110111101100100001001001000001010100010101110111101001101110011001101110011110001100100111011110110010000100100100000101010001010111011110100110111001011110011111010110001001000101 e78c9dec849054577a6e66e78c9dec849054577a6e5e7d59e78c9dec849054577a6e66e78c9dec849054577a6e5e7d6245
EUC-JP 迪晧юTWznf迪晧юTWzn^}Y迪晧юTWznf迪晧юTWzn^}bE 11101101111011001101101011101110101001111111000001010100010101110111101001101110011001101110110111101100110110101110111010100111111100000101010001010111011110100110111001011110011111010101100111101101111011001101101011101110101001111111000001010100010101110111101001101110011001101110110111101100110110101110111010100111111100000101010001010111011110100110111001011110011111010110001001000101 edecdaeea7f054577a6e66edecdaeea7f054577a6e5e7d59edecdaeea7f054577a6e66edecdaeea7f054577a6e5e7d6245
UTF-8 迪晧юTWznf迪晧юTWzn^}Y迪晧юTWznf迪晧юTWzn^}bE 111010001011111110101010111001101001100110100111110100011000111001010100010101110111101001101110011001101110100010111111101010101110011010011001101001111101000110001110010101000101011101111010011011100101111001111101010110011110100010111111101010101110011010011001101001111101000110001110010101000101011101111010011011100110011011101000101111111010101011100110100110011010011111010001100011100101010001010111011110100110111001011110011111010110001001000101 e8bfaae699a7d18e54577a6e66e8bfaae699a7d18e54577a6e5e7d59e8bfaae699a7d18e54577a6e66e8bfaae699a7d18e54577a6e5e7d6245
UHC 迪晧юTWznf迪晧юTWzn^}Y迪晧юTWznf迪晧юTWzn^}bE 11101110111010001111101111000101101011001111000001010100010101110111101001101110011001101110111011101000111110111100010110101100111100000101010001010111011110100110111001011110011111010101100111101110111010001111101111000101101011001111000001010100010101110111101001101110011001101110111011101000111110111100010110101100111100000101010001010111011110100110111001011110011111010110001001000101 eee8fbc5acf054577a6e66eee8fbc5acf054577a6e5e7d59eee8fbc5acf054577a6e66eee8fbc5acf054577a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)