To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?丹?短?丹?鰻◇?丹?短?丹?鰻●^ 0011111110010010010011110011111110010010010110100011111110010010010011110011111110001001010101101000000110011110001111111001001001001111001111111001001001011010001111111001001001001111001111111000100101010110100000011001110001011110 3f924f3f925a3f924f3f8956819e3f924f3f925a3f924f3f8956819c5e
EUC-JP ?丹?短?丹?鰻◇?丹?短?丹?鰻●^ 0011111111000011101100000011111111000011101110110011111111000011101100000011111110110001101101111010000111111110001111111100001110110000001111111100001110111011001111111100001110110000001111111011000110110111101000011111110001011110 3fc3b03fc3bb3fc3b03fb1b7a1fe3fc3b03fc3bb3fc3b03fb1b7a1fc5e
UTF-8 렺丹렺短렺丹렺鰻◇렺丹렺短렺丹렺鰻●^ 11101011101000001011101011100100101110001011100111101011101000001011101011100111100111111010110111101011101000001011101011100100101110001011100111101011101000001011101011101001101100001011101111100010100101111000011111101011101000001011101011100100101110001011100111101011101000001011101011100111100111111010110111101011101000001011101011100100101110001011100111101011101000001011101011101001101100001011101111100010100101111000111101011110 eba0bae4b8b9eba0bae79fadeba0bae4b8b9eba0bae9b0bbe29787eba0bae4b8b9eba0bae79fadeba0bae4b8b9eba0bae9b0bbe2978f5e
UHC 렺丹렺短렺丹렺鰻◇렺丹렺短렺丹렺鰻●^ 10001110110000101101001110100001100011101100001011010011101011011000111011000010110100111010000110001110110000101101100011000100101000011101111010001110110000101101001110100001100011101100001011010011101011011000111011000010110100111010000110001110110000101101100011000100101000011101110001011110 8ec2d3a18ec2d3ad8ec2d3a18ec2d8c4a1de8ec2d3a18ec2d3ad8ec2d3a18ec2d8c4a1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)