To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蒸?迂狡¬碇?乙?蒸?迂狡¬碇?媛?^ 10001111111101100011111110001001010010011110000011000010100000011100101010010010111101000011111110001001101100110011111110001111111101100011111110001001010010011110000011000010100000011100101010010010111101000011111110010101010100010011111101011110 8ff63f8949e0c281ca92f43f89b33f8ff63f8949e0c281ca92f43f95513f5e
EUC-JP 蒸?迂狡¬碇?乙?蒸?迂狡¬碇?媛?^ 10111110111110000011111110110001101010101110000011000100101000101100110011000100111101100011111110110010101101010011111110111110111110000011111110110001101010101110000011000100101000101100110011000100111101100011111111001001101100100011111101011110 bef83fb1aae0c4a2ccc4f63fb2b53fbef83fb1aae0c4a2ccc4f63fc9b23f5e
UTF-8 蒸렣迂狡¬碇렢乙렏蒸렣迂狡¬碇렢媛렜^ 11101000100100101011100011101011101000001010001111101000101111111000001011100111100010111010000111101111101111111010001011100111101000101000011111101011101000001010001011100100101110011001100111101011101000001000111111101000100100101011100011101011101000001010001111101000101111111000001011100111100010111010000111101111101111111010001011100111101000101000011111101011101000001010001011100101101010101001101111101011101000001001110001011110 e892b8eba0a3e8bf82e78ba1efbfa2e7a287eba0a2e4b999eba08fe892b8eba0a3e8bf82e78ba1efbfa2e7a287eba0a2e5aa9beba09c5e
UHC 蒸렣迂狡¬碇렢乙렏蒸렣迂狡¬碇렢媛렜^ 11110001111110101000111010110100111010011110011011001110111010101010000111111110111011111110110110001110101100111110101111100000100011101010010111110001111110101000111010110100111010011110011011001110111010101010000111111110111011111110110110001110101100111110101010110000100011101010111001011110 f1fa8eb4e9e6ceeaa1feefed8eb3ebe08ea5f1fa8eb4e9e6ceeaa1feefed8eb3eab08eae5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)