To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ?純ぁ□??ぴ?圍}?純ぁ□??ぴ?圍{^ 00111111100011111000001110000010100111111000000110100000001111110011111110000010110100100011111110011010101000010111110100111111100011111000001110000010100111111000000110100000001111110011111110000010110100100011111110011010101000010111101101011110 3f8f83829f81a03f3f82d23f9aa17d3f8f83829f81a03f3f82d23f9aa17b5e
EUC-JP ?純ぁ□??ぴ?圍}?純ぁ□??ぴ?圍{^ 00111111101111011110001110100100101000011010001010100010001111110011111110100100110101000011111111010100101000110111110100111111101111011110001110100100101000011010001010100010001111110011111110100100110101000011111111010100101000110111101101011110 3fbde3a4a1a2a23f3fa4d43fd4a37d3fbde3a4a1a2a23f3fa4d43fd4a37b5e
UTF-8 룵純ぁ□룴횕ぴ룫圍}룵純ぁ□룴횕ぴ룫圍{^ 111010111010001110110101111001111011010010010100111000111000000110000001111000101001011010100001111010111010001110110100111011011001101010010101111000111000000110110100111010111010001110101011111001011001110010001101011111011110101110100011101101011110011110110100100101001110001110000001100000011110001010010110101000011110101110100011101101001110110110011010100101011110001110000001101101001110101110100011101010111110010110011100100011010111101101011110 eba3b5e7b494e38181e296a1eba3b4ed9a95e381b4eba3abe59c8d7deba3b5e7b494e38181e296a1eba3b4ed9a95e381b4eba3abe59c8d7b5e
UHC 룵純ぁ□룴횕ぴ룫圍}룵純ぁ□룴횕ぴ룫圍{^ 100011111010101011100010111011011010101010100001101000011110000010001111101010011100001110001111101010101101010010001111101000101110101011001100011111011000111110101010111000101110110110101010101000011010000111100000100011111010100111000011100011111010101011010100100011111010001011101010110011000111101101011110 8faae2edaaa1a1e08fa9c38faad48fa2eacc7d8faae2edaaa1a1e08fa9c38faad48fa2eacc7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)