To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 諸??云?韻逢??}諸??云?韻逢??{^ 1000111110010100001111110011111110001001010111010011111110001001010000111000100010100111001111110011111101111101100011111001010000111111001111111000100101011101001111111000100101000011100010001010011100111111001111110111101101011110 8f943f3f895d3f894388a73f3f7d8f943f3f895d3f894388a73f3f7b5e
EUC-JP 諸??云?韻逢??}諸??云?韻逢??{^ 1011110111110100001111110011111110110001101111100011111110110001101001001011000010101001001111110011111101111101101111011111010000111111001111111011000110111110001111111011000110100100101100001010100100111111001111110111101101011110 bdf43f3fb1be3fb1a4b0a93f3f7dbdf43f3fb1be3fb1a4b0a93f3f7b5e
UTF-8 諸쇠浪云렗韻逢렰렫}諸쇠浪云렗韻逢렰렫{^ 111010001010101110111000111011001000011110100000111011111010010010101010111001001011101010010001111010111010000010010111111010011001111110111011111010011000000010100010111010111010000010110000111010111010000010101011011111011110100010101011101110001110110010000111101000001110111110100100101010101110010010111010100100011110101110100000100101111110100110011111101110111110100110000000101000101110101110100000101100001110101110100000101010110111101101011110 e8abb8ec87a0efa4aae4ba91eba097e99fbbe980a2eba0b0eba0ab7de8abb8ec87a0efa4aae4ba91eba097e99fbbe980a2eba0b0eba0ab7b5e
UHC 諸쇠浪云렗韻逢렰렫}諸쇠浪云렗韻逢렰렫{^ 111100001011001110111100111010001101001010101001111010011111011010001110101011001110101010100100110111001111000110001110101111011000111010111001011111011111000010110011101111001110100011010010101010011110100111110110100011101010110011101010101001001101110011110001100011101011110110001110101110010111101101011110 f0b3bce8d2a9e9f68eaceaa4dcf18ebd8eb97df0b3bce8d2a9e9f68eaceaa4dcf18ebd8eb97b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)