To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????nf????n^}Y????nf????n^}bE 0011111100111111001111110011111101101110011001100011111100111111001111110011111101101110010111100111110101011001001111110011111100111111001111110110111001100110001111110011111100111111001111110110111001011110011111010110001001000101 3f3f3f3f6e663f3f3f3f6e5e7d593f3f3f3f6e663f3f3f3f6e5e7d6245
SJIS-WIN 腫???nf腫???n^}Y腫???nf腫???n^}bE 100011101110111000111111001111110011111101101110011001101000111011101110001111110011111100111111011011100101111001111101010110011000111011101110001111110011111100111111011011100110011010001110111011100011111100111111001111110110111001011110011111010110001001000101 8eee3f3f3f6e668eee3f3f3f6e5e7d598eee3f3f3f6e668eee3f3f3f6e5e7d6245
EUC-JP 腫???nf腫???n^}Y腫???nf腫???n^}bE 101111001111000000111111001111110011111101101110011001101011110011110000001111110011111100111111011011100101111001111101010110011011110011110000001111110011111100111111011011100110011010111100111100000011111100111111001111110110111001011110011111010110001001000101 bcf03f3f3f6e66bcf03f3f3f6e5e7d59bcf03f3f3f6e66bcf03f3f3f6e5e7d6245
UTF-8 腫띠렮렧nf腫띠렮렧n^}Y腫띠렮렧nf腫띠렮렧n^}bE 11101000100001011010101111101011100111011010000011101011101000001010111011101011101000001010011101101110011001101110100010000101101010111110101110011101101000001110101110100000101011101110101110100000101001110110111001011110011111010101100111101000100001011010101111101011100111011010000011101011101000001010111011101011101000001010011101101110011001101110100010000101101010111110101110011101101000001110101110100000101011101110101110100000101001110110111001011110011111010110001001000101 e885abeb9da0eba0aeeba0a76e66e885abeb9da0eba0aeeba0a76e5e7d59e885abeb9da0eba0aeeba0a76e66e885abeb9da0eba0aeeba0a76e5e7d6245
UHC 腫띠렮렧nf腫띠렮렧n^}Y腫띠렮렧nf腫띠렮렧n^}bE 111100001111111010110110111011001000111010111011100011101011011001101110011001101111000011111110101101101110110010001110101110111000111010110110011011100101111001111101010110011111000011111110101101101110110010001110101110111000111010110110011011100110011011110000111111101011011011101100100011101011101110001110101101100110111001011110011111010110001001000101 f0feb6ec8ebb8eb66e66f0feb6ec8ebb8eb66e5e7d59f0feb6ec8ebb8eb66e66f0feb6ec8ebb8eb66e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)