To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???宥∽?怨??要??誼ゅ?邑??雅 001111110011111100111111100101110100011110000001111001000011111110001001100001010011111100111111100101110111011000111111001111111000101101100010100000101110001100111111100101110101011100111111001111111000100111101011 3f3f3f974781e43f89853f3f97763f3f8b6282e33f97573f3f89eb
EUC-JP ???宥∽?怨??要??誼ゅ?邑??雅 001111110011111100111111110011011010100010100010111001100011111110110001111001010011111100111111110011011101011100111111001111111011010111000011101001001110010100111111110011011011100000111111001111111011001011101101 3f3f3fcda8a2e63fb1e53f3fcdd73f3fb5c3a4e53fcdb83f3fb2ed
UTF-8 麗멸퇌宥∽쭓怨뺤졁要쏄퉭誼ゅ▶邑뀁졋雅 111011111010011010001000111010111010100110111000111011011000011110001100111001011010111010100101111000101000100010111101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000110000001111010001010011010000001111011001000111110000100111011011000100110101101111010001010101010111100111000111000001010000101111000101001011010110110111010011000001010010001111010111000000010000001111011001010000110001011111010011001101110000101 efa688eba9b8ed878ce5aea5e288bdecad93e680a8ebbaa4eca181e8a681ec8f84ed89ade8aabce38285e296b6e98291eb8081eca18be99b85
UHC 麗멸퇌宥∽쭓怨뺤졁要쏄퉭誼ゅ▶邑뀁졋雅 1110011010110000101110001110101010110111100111011110101011101001101000011110111110100111100010111110101010110011100101011110110010100000101100101110100110101001100110111110101010111001100001011110101111111110101010101110010110100010101110101110101111101001101100101110110010100000101110101110010010111010 e6b0b8eab79deae9a1efa78beab395eca0b2e9a99beab985ebfeaae5a2baebe9b2eca0bae4ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)