To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ???油??儒??v???油??儒??vB 00111111001111110011111110010110111110110011111100111111100011101111001000111111001111110111011000111111001111110011111110010110111110110011111100111111100011101111001000111111001111110111011001000010 3f3f3f96fb3f3f8ef23f3f763f3f3f96fb3f3f8ef23f3f7642
EUC-JP ???油??儒??v???油??儒??vB 00111111001111110011111111001100111111010011111100111111101111001111010000111111001111110111011000111111001111110011111111001100111111010011111100111111101111001111010000111111001111110111011001000010 3f3f3fccfd3f3fbcf43f3f763f3f3fccfd3f3fbcf43f3f7642
UTF-8 力녹떒油녽슆儒띠쉠v力녹떒油녽슆儒띠쉠vB 111011111010011010001010111010111000010110111001111010111001011010010010111001101011001010111001111010111000010110111101111011001000101010000110111001011000010010010010111010111001110110100000111011001000100110100000011101101110111110100110100010101110101110000101101110011110101110010110100100101110011010110010101110011110101110000101101111011110110010001010100001101110010110000100100100101110101110011101101000001110110010001001101000000111011001000010 efa68aeb85b9eb9692e6b2b9eb85bdec8a86e58492eb9da0ec89a076efa68aeb85b9eb9692e6b2b9eb85bdec8a86e58492eb9da0ec89a07642
UHC 力녹떒油녽슆儒띠쉠v力녹떒油녽슆儒띠쉠vB 111001101011001110110011111011001000101110101000111010101111101010000110111010011001101010011000111010101110001110110110111011001011110110101010011101101110011010110011101100111110110010001011101010001110101011111010100001101110100110011010100110001110101011100011101101101110110010111101101010100111011001000010 e6b3b3ec8ba8eafa86e99a98eae3b6ecbdaa76e6b3b3ec8ba8eafa86e99a98eae3b6ecbdaa7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)