To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??攸??馭??????援??揄?? 001111110011111100111111111000101000011000111111001111111001110110111111001111110011111111101001011001100011111100111111001111110011111100111111001111111000100110000111001111110011111110011101100010010011111100111111 3f3f3fe2863f3f9dbf3f3fe9663f3f3f3f3f3f89873f3f9d893f3f
EUC-JP ???竊??攸??馭???濚?Ŧ援??揄?? 00111111001111110011111111100011111001100011111100111111110110101100000100111111001111111111000111000111001111110011111100111111100011111100100110100001001111111000111110101001101011111011000111100111001111110011111111011001111010010011111100111111 3f3f3fe3e63f3fdac13f3ff1c73f3f3f8fc9a13f8fa9afb1e73f3fd9e93f3f
UTF-8 捻뀁뮆竊섋린攸귣뼀馭궽쎈쳸濚밸Ŧ援€뿥揄우땁 1110111110100110101001001110101110000000100000011110101110101110100001101110011110101011100010101110110010000100100010111110101110100110101100001110011010010100101110001110101010110111101000111110101110111100100000001110100110100110101011011110101010110110101111011110110010001110100010001110110010110011101110001110011010111111100110101110101110110000101110001100010110100110111001101000111110110100111000101000001010101100111010111011111110100101111001101000111110000100111011001001101010110000111010111001010110000001 efa6a4eb8081ebae86e7ab8aec848beba6b0e694b8eab7a3ebbc80e9a6adeab6bdec8e88ecb3b8e6bf9aebb0b8c5a6e68fb4e282acebbfa5e68f84ec9ab0eb9581
UHC 捻뀁뮆竊섋린攸귣뼀馭궽쎈쳸濚밸Ŧ援€뿥揄우땁 1110011011110111101100101110110010010010100101011110111110111100100110001110100010111000101100001110101011110010100000101110101110010110100010111110010111011111100000101100111010111101111010111010101110011011111001111011100110111001111010111010100010101110111010101011010110100010111001101001011110100101111010101111000110111111111011001011011010100010 e6f7b2ec9295efbc98e8b8b0eaf282eb968be5df82cebdebab9be7b9b9eba8aeeab5a2e697a5eaf1bfecb6a2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)