To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈺??肄ょ?誘γ????肉??循??嚴 111110111100010000111111001111111110001111100101100000101110010100111111100101110101010110000011110000010011111100111111001111110011111110010011111101110011111100111111100011110111101000111111001111111001101010001110 fbc43f3fe3e582e53f975583c13f3f3f3f93f73f3f8f7a3f3f9a8e
EUC-JP 鈺??肄ょ?誘γ????肉??循??嚴 10001111111000111101010100111111001111111110011011100111101001001110011100111111110011011011011010100110110000110011111100111111001111110011111111000110111110010011111100111111101111011101101100111111001111111101001111101110 8fe3d53f3fe6e7a4e73fcdb6a6c33f3f3f3fc6f93f3fbddb3f3fd3ee
UTF-8 鈺쎄막肄ょ㎣誘γ궕僚노퀬肉덁튋循낆떼嚴 1110100110001000101110101110110010001110100001001110101110100111100010011110100010000010100001001110001110000010100001111110001110001110101000111110100010101010100110001100111010110011111010101011011010010101111011111010011010111011111010111000010110111000111011011000000010101100111010001000001010001001111010111000110110000001111011011000101010001011111001011011111010101010111010111000001010000110111010111001011010111100111001011001101010110100 e988baec8e84eba789e88284e38287e38ea3e8aa98ceb3eab695efa6bbeb85b8ed80ace88289eb8d81ed8a8be5beaaeb8286eb96bce59ab4
UHC 鈺쎄막肄ょ㎣誘γ궕僚노퀬肉덁튋循낆떼嚴 1110100010101101101111011110101010111000101101111110110010111101101010101110011110100111101001111110101110101111101001011110001110000010101010101110100011101000101100111110101110110011101000001110101110111111100010001110010010111001100111111110001011100000100001011110110010110110101111001110010111110001 e8adbdeab8b7ecbdaae7a7a7ebafa5e382aae8e8b3ebb3a0ebbf88e4b99fe2e085ecb6bce5f1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)