To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???乙??亦?????怨??筌??誼?? 0011111100111111001111111000100110110011001111110011111110010110100100100011111100111111001111110011111100111111100010011000010100111111001111111110001010100011001111110011111110001011011000100011111100111111 3f3f3f89b33f3f96923f3f3f3f3f89853f3fe2a33f3f8b623f3f
EUC-JP ???乙??亦??沅??怨??筌??誼?? 00111111001111110011111110110010101101010011111100111111110010111111001000111111001111111000111111000110111010010011111100111111101100011110010100111111001111111110010010100101001111110011111110110101110000110011111100111111 3f3f3fb2b53f3fcbf23f3f8fc6e93f3fb1e53f3fe4a53f3fb5c33f3f
UTF-8 捻뀁뫑乙대뎐亦껋눘沅싪슀怨⑸샷筌믨퀣誼섊뙠 111011111010011010100100111010111000000010000001111010111010101110010001111001001011100110011001111010111000110010000000111010111000111010010000111001001011101010100110111010101011101110001011111010111000100010011000111001101011001010000101111011001000101110101010111011001000101010000000111001101000000010101000111000101001000110111000111011001000001110110111111001111010110110001100111010111010111110101000111011011000000010100011111010001010101010111100111011001000010010001010111010111001100110100000 efa6a4eb8081ebab91e4b999eb8c80eb8e90e4baa6eabb8beb8898e6b285ec8baaec8a80e680a8e291b8ec83b7e7ad8cebafa8ed80a3e8aabcec848aeb99a0
UHC 捻뀁뫑乙대뎐亦껋눘沅싪슀怨⑸샷筌믨퀣誼섊뙠 111001101111011110110010111011001001000110110011111010111110000010110100111010111011010110101111111001101011001010000011111011001000011110110001111010101011011010011010111010001001101010010011111010101011001110101001111010111011110010100110111011111010011110010010111010101011001110010111111010111111111010011000111001111000110010100101 e6f7b2ec91b3ebe0b4ebb5afe6b283ec87b1eab69ae89a93eab3a9ebbca6efa792eab397ebfe98e78ca5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)