To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 節??蘂??節③?z節??蘂??節③?zB 1001000011011111001111110011111111100101010000010011111100111111100100001101111110000111010000100011111101111010100100001101111100111111001111111110010101000001001111110011111110010000110111111000011101000010001111110111101001000010 90df3f3fe5413f3f90df87423f7a90df3f3fe5413f3f90df87423f7a42
EUC-JP 節??蘂??節??z節??蘂??節??zB 110000001110000100111111001111111110100110100010001111110011111111000000111000010011111100111111011110101100000011100001001111110011111111101001101000100011111100111111110000001110000100111111001111110111101001000010 c0e13f3fe9a23f3fc0e13f3f7ac0e13f3fe9a23f3fc0e13f3f7a42
UTF-8 節억쉬蘂끾㉤節③쎁z節억쉬蘂끾㉤節③쎁zB 111001111010111110000000111011001001011010110101111011001000100110101100111010001001100010000010111010111000000110111110111000111000100110100100111001111010111110000000111000101001000110100010111011001000111010000001011110101110011110101111100000001110110010010110101101011110110010001001101011001110100010011000100000101110101110000001101111101110001110001001101001001110011110101111100000001110001010010001101000101110110010001110100000010111101001000010 e7af80ec96b5ec89ace89882eb81bee389a4e7af80e291a2ec8e817ae7af80ec96b5ec89ace89882eb81bee389a4e7af80e291a2ec8e817a42
UHC 節억쉬蘂끾㉤節③쎁z節억쉬蘂끾㉤節③쎁zB 111011111011110110111110111011111011110110101100111001111101111010000101111001101010100010110101111011111011110110101000111010011001101110101011011110101110111110111101101111101110111110111101101011001110011111011110100001011110011010101000101101011110111110111101101010001110100110011011101010110111101001000010 efbdbeefbdace7de85e6a8b5efbda8e99bab7aefbdbeefbdace7de85e6a8b5efbda8e99bab7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)