To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 搖?????宋???▲????兢???筍?? 100111011000101000111111001111110011111100111111001111111001000101110110001111110011111100111111100000011010001100111111001111110011111100111111100110010101110100111111001111110011111111100010101000010011111100111111 9d8a3f3f3f3f3f91763f3f3f81a33f3f3f3f995d3f3f3fe2a13f3f
EUC-JP 搖??靷??宋??璵▲????兢???筍?? 11011001111010100011111100111111100011111110011110111101001111110011111111000001110101110011111100111111100011111100110011100110101000101010010100111111001111110011111100111111110100011011111000111111001111110011111111100100101000110011111100111111 d9ea3f3f8fe7bd3f3fc1d73f3f8fcce6a2a53f3f3f3fd1be3f3f3fe4a33f3f
UTF-8 搖깅ㅏ靷숁만宋볦뵖璵▲룂留뗥슫兢理롩솻筍⑹챽 111001101001000010010110111010101011100110000101111000111000010110001111111010011001110110110111111011001000100010000001111010111010011110001100111001011010111010001011111010111011001110100110111010111011010110010110111001111001001010110101111000101001011010110010111010111010001110000010111011111010011110001101111010111001011110100101111011001000101010101011111001011000010110100010111011111010011110100100111010111010000110101001111011001000011010111011111001111010110110001101111000101001000110111001111011001011000110111101 e69096eab985e3858fe99db7ec8881eba78ce5ae8bebb3a6ebb596e792b5e296b2eba382efa78deb97a5ec8aabe585a2efa7a4eba1a9ec86bbe7ad8de291b9ecb1bd
UHC 搖깅ㅏ靷숁만宋볦뵖璵▲룂留뗥슫兢理롩솻筍⑹챽 1110100011110100101100011110101110100100101111111110110011100110100110011110011010111000101110001110000111100100100100111110110010010100100110001110011010100101101000011110001110001111100000111110101110100111100010111110010110011010101101001101000011100111111011001011010110001110111010011001100110110000111000101110110010101001111011001010101010001010 e8f4b1eba4bfece699e6b8b8e1e493ec9498e6a5a1e38f83eba78be59ab4d0e7ecb58ee999b0e2eca9ecaa8a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)