To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN ?厓ぢ??ケ??ギ}v?厓ぢ??ケ??ギ}vB 00111111111110101000110110000010110000000011111100111111100000110101000000111111001111111000001101001101011111010111011000111111111110101000110110000010110000000011111100111111100000110101000000111111001111111000001101001101011111010111011001000010 3ffa8d82c03f3f83503f3f834d7d763ffa8d82c03f3f83503f3f834d7d7642
EUC-JP ?厓ぢ??ケ??ギ}v?厓ぢ??ケ??ギ}vB 001111111000111110110100110001111010010011000010001111110011111110100101101100010011111100111111101001011010111001111101011101100011111110001111101101001100011110100100110000100011111100111111101001011011000100111111001111111010010110101110011111010111011001000010 3f8fb4c7a4c23f3fa5b13f3fa5ae7d763f8fb4c7a4c23f3fa5b13f3fa5ae7d7642
UTF-8 룶厓ぢ룵쾹ケ룵卽ギ}v룶厓ぢ룵쾹ケ룵卽ギ}vB 1110101110100011101101101110010110001110100100111110001110000001101000101110101110100011101101011110110010111110101110011110001110000010101100011110101110100011101101011110010110001101101111011110001110000010101011100111110101110110111010111010001110110110111001011000111010010011111000111000000110100010111010111010001110110101111011001011111010111001111000111000001010110001111010111010001110110101111001011000110110111101111000111000001010101110011111010111011001000010 eba3b6e58e93e381a2eba3b5ecbeb9e382b1eba3b5e58dbde382ae7d76eba3b6e58e93e381a2eba3b5ecbeb9e382b1eba3b5e58dbde382ae7d7642
UHC 룶厓ぢ룵쾹ケ룵卽ギ}v룶厓ぢ룵쾹ケ룵卽ギ}vB 1000111110101011111001001110110110101010110000101000111110101010101100101000111110101011101100011000111110101010111100011110110110101011101011100111110101110110100011111010101111100100111011011010101011000010100011111010101010110010100011111010101110110001100011111010101011110001111011011010101110101110011111010111011001000010 8fabe4edaac28faab28fabb18faaf1edabae7d768fabe4edaac28faab28fabb18faaf1edabae7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)