To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雍??肉f?懿??孃る?苑?????椰 111010001011010000111111001111111001001111110111100000101000011000111111100111001111001000111111001111111001101101101111100000101110100100111111100010011001000100111111001111110011111100111111001111111001111010111101 e8b43f3f93f782863f9cf23f3f9b6f82e93f89913f3f3f3f3f9ebd
EUC-JP 雍??肉f?懿??孃る?苑?????椰 111100001011011000111111001111111100011011111001101000111110011000111111110110001111010000111111001111111101010111010000101001001110101100111111101100011111000100111111001111110011111100111111001111111101110010111111 f0b63f3fc6f9a3e63fd8f43f3fd5d0a4eb3fb1f13f3f3f3f3fdcbf
UTF-8 雍우궠肉f뤃懿꿸턀孃る굞苑묈쪊硫명뇠椰 111010011001101110001101111011001001101010110000111010101011011010100000111010001000001010001001111011111011110110000110111010111010010010000011111001101000011110111111111010101011111110111000111011011000010010000000111001011010110110000011111000111000001010001011111010101011010110011110111010001000101110010001111010111010110010001000111011001010101010001010111011111010011110001110111010111010101010000101111010111000011110100000111001101010010010110000 e99b8dec9ab0eab6a0e88289efbd86eba483e687bfeabfb8ed8480e5ad83e3828beab59ee88b91ebac88ecaa8aefa78eebaa85eb87a0e6a4b0
UHC 雍우궠肉f뤃懿꿸턀孃る굞苑묈쪊硫명뇠椰 1110100010111100101111111110110010000010101100111110101110111111101000111110011010001111101101001110101111110011101100101110101010110101100111001110010110111110101010101110101110000010100001101110101010111101100100011110010110100101100001001110101110101001101110001110110110000111100010001110010110101011 e8bcbfec82b3ebbfa3e68fb4ebf3b2eab59ce5beaaeb8286eabd91e5a584eba9b8ed8788e5ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)