To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 昻??竊??碎??齬??猷??濡リ?癰??異 11111010110100000011111100111111111000101000011000111111001111111110000111101010001111110011111111101010100101110011111100111111100101110101000100111111001111111001010001000111100000111000101000111111111000011001111000111111001111111000100011011001 fad03f3fe2863f3fe1ea3f3fea973f3f97513f3f9447838a3fe19e3f3f88d9
EUC-JP ???竊??碎??齬??猷??濡リ?癰??異 001111110011111100111111111000111110011000111111001111111110001011101100001111110011111111110011111101110011111100111111110011011011001000111111001111111100011110101000101001011110101000111111111000011111111000111111001111111011000011011011 3f3f3fe3e63f3fe2ec3f3ff3f73f3fcdb23f3fc7a8a5ea3fe1fe3f3fb0db
UTF-8 昻뉗떜竊섓쭕碎쇈럶齬잙뱪猷뗩뼸濡リ컼癰귥뇴異 111001101001100010111011111010111000100110010111111010111001011010011100111001111010101110001010111011001000010010010011111011001010110110010101111001111010001010001110111011001000011110001000111010111001111110110110111010011011110110101100111011001001111010011001111010111011000110101010111001111000110010110111111010111001011110101001111010111011110010111000111001101011111110100001111000111000001110101010111011001011101110111100111001111001100110110000111010101011011110100101111010111000011110110100111001111001010110110000 e698bbeb8997eb969ce7ab8aec8493ecad95e7a28eec8788eb9fb6e9bdacec9e99ebb1aae78cb7eb97a9ebbcb8e6bfa1e383aaecbbbce799b0eab7a5eb87b4e795b0
UHC 昻뉗떜竊섓쭕碎쇈럶齬잙뱪猷뗩뼸濡リ컼癰귥뇴異 1110010011101001100001111110110010001011101100101110111110111100100110001110111110100111100011011110000111101111101111001110001110001110100101011110010111100001100111111110101110010011100100001110101110100011100010111110100110010110101110111110101110100001101010111110101010110000100111011110100010111001100000101110110010000111100110001110110010110110 e4e987ec8bb2efbc98efa78de1efbce38e95e5e19feb9390eba38be996bbeba1abeab09de8b982ec8798ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)