To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 搖??泣??扱肉??純??鳶??違▽? 100111011000101000111111001111111000101110000011001111110011111110001000101101011001001111110111001111110011111110001111100000110011111100111111100100111100111000111111001111111000100011100001100000011010010000111111 9d8a3f3f8b833f3f88b593f73f3f8f833f3f93ce3f3f88e181a43f
EUC-JP 搖??泣??扱肉??純??鳶??違▽? 110110011110101000111111001111111011010111100011001111110011111110110000101101111100011011111001001111110011111110111101111000110011111100111111110001101101000000111111001111111011000011100011101000101010011000111111 d9ea3f3fb5e33f3fb0b7c6f93f3fbde33f3fc6d03f3fb0e3a2a63f
UTF-8 搖깅쪇泣길끽扱肉욤첋純껓폏鳶롫끏違▽풚 111001101001000010010110111010101011100110000101111011001010101010000111111001101011001110100011111010101011100010111000111010111000000110111101111001101000100110110001111010001000001010001001111011001001101010100100111011001011001010001011111001111011010010010100111010101011101110010011111011011000111110001111111010011011001110110110111010111010000110101011111010111000000110001111111010011000000110010101111000101001011010111101111011011001001010011010 e69096eab985ecaa87e6b3a3eab8b8eb81bde689b1e88289ec9aa4ecb28be7b494eabb93ed8f8fe9b3b6eba1abeb818fe98195e296bded929a
UHC 搖깅쪇泣길끽扱肉욤첋純껓폏鳶롫끏違▽풚 1110100011110100101100011110101110100101100000011110101111101000101100011110011010110011101000111101000011100010111010111011111110111111111010001010101010011000111000101110110110000011111011111011110010011010111001101110100110001110111010111000010110111111111010101101111010100001111001001011111010011101 e8f4b1eba581ebe8b1e6b3a3d0e2ebbfbfe8aa98e2ed83efbc9ae6e98eeb85bfeadea1e4be9d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)