To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雍??肉f?醫??獄??援?????亦??? 1110100010110100001111110011111110010011111101111000001010000110001111111110011111001110001111110011111110001101100101100011111100111111100010011000011100111111001111110011111100111111001111111001011010010010001111110011111100111111 e8b43f3f93f782863fe7ce3f3f8d963f3f89873f3f3f3f3f96923f3f3f
EUC-JP 雍??肉f?醫??獄??援?????亦??? 1111000010110110001111110011111111000110111110011010001111100110001111111110111011010000001111110011111110111001111101100011111100111111101100011110011100111111001111110011111100111111001111111100101111110010001111110011111100111111 f0b63f3fc6f9a3e63feed03f3fb9f63f3fb1e73f3f3f3f3fcbf23f3f3f
UTF-8 雍우궠肉f뤃醫귣럞獄쏄퀣援쎾쪊硫몃굜亦낅틹劉 111010011001101110001101111011001001101010110000111010101011011010100000111010001000001010001001111011111011110110000110111010111010010010000011111010011000011010101011111010101011011110100011111010111001111110011110111001111000110110000100111011001000111110000100111011011000000010100011111001101000111110110100111011001000111010111110111011001010101010001010111011111010011110001110111010111010101010000011111010101011010110011100111001001011101010100110111010111000001010000101111011011000101110111001111011111010011110000111 e99b8dec9ab0eab6a0e88289efbd86eba483e986abeab7a3eb9f9ee78d84ec8f84ed80a3e68fb4ec8ebeecaa8aefa78eebaa83eab59ce4baa6eb8285ed8bb9efa787
UHC 雍우궠肉f뤃醫귣럞獄쏄퀣援쎾쪊硫몃굜亦낅틹劉 1110100010111100101111111110110010000010101100111110101110111111101000111110011010001111101101001110110010100010100000101110101110001110100000011110100010101011100110111110101010110011100101111110101010110101100110111110010110100101100001001110101110101001101110001110101110000010100001001110011010110010100001011110101110111010100111111110101011100101 e8bcbfec82b3ebbfa3e68fb4eca282eb8e81e8ab9beab397eab59be5a584eba9b8eb8284e6b285ebba9feae5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)