To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??肄??宥??艶k?遊??恂ル?? 111010100100000000111111001111111110001111100101001111110011111110010111010001110011111100111111100010011001000010000010100010110011111110010111010101100011111100111111100111001001011010000011100010110011111100111111 ea403f3fe3e53f3f97473f3f8990828b3f97563f3f9c96838b3f3f
EUC-JP 鵝??肄??宥??艶k?遊??恂ル?? 111100111010000100111111001111111110011011100111001111110011111111001101101010000011111100111111101100011111000010100011111010110011111111001101101101110011111100111111110101111111011010100101111010110011111100111111 f3a13f3fe6e73f3fcda83f3fb1f0a3eb3fcdb73f3fd7f6a5eb3f3f
UTF-8 鵝숈뮆肄덃끽宥뱀궃艶k벚遊꾤솾恂ル쐜力 111010011011010110011101111011001000100010001000111010111010111010000110111010001000001010000100111010111000110110000011111010111000000110111101111001011010111010100101111010111011000110000000111010101011011010000011111010001000100110110110111011111011110110001011111010111011001010011010111010011000000110001010111010101011111010100100111011001000011010111110111001101000000110000010111000111000001110101011111011001001000010011100111011111010011010001010 e9b59dec8888ebae86e88284eb8d83eb81bde5aea5ebb180eab683e889b6efbd8bebb29ae9818aeabea4ec86bee68182e383abec909cefa68a
UHC 鵝숈뮆肄덃끽宥뱀궃艶k벚遊꾤솾恂ル쐜力 1110010010111101100110011110110010010010100101011110110010111101100010001110011010110011101000111110101011101001101110011110110010000010100111001110011011111101101000111110101110111010101000101110101110110100100001001110011110011001101100101110001011100001101010111110101110011100100000101110011010110011 e4bd99ec9295ecbd88e6b3a3eae9b9ec829ce6fda3ebbaa2ebb484e799b2e2e1abeb9c82e6b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)