To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??矣??鵝??肄??幼??孃る?悠 111000011001111100111111001111111110000111100001001111110011111111101010010000000011111100111111111000111110010100111111001111111001011101100011001111110011111110011011011011111000001011101001001111111001011101001001 e19f3f3fe1e13f3fea403f3fe3e53f3f97633f3f9b6f82e93f9749
EUC-JP 癲??矣??鵝??肄??幼??孃る?悠 111000101010000100111111001111111110001011100011001111110011111111110011101000010011111100111111111001101110011100111111001111111100110111000100001111110011111111010101110100001010010011101011001111111100110110101010 e2a13f3fe2e33f3ff3a13f3fe6e73f3fcdc43f3fd5d0a4eb3fcdaa
UTF-8 癲졻뵩矣곕룺鵝숈뮆肄덃끽幼먯춪孃る쪇悠 111001111001100110110010111011001010000110111011111010111011010110101001111001111001111110100011111010101011001110010101111010111010001110111010111010011011010110011101111011001000100010001000111010111010111010000110111010001000001010000100111010111000110110000011111010111000000110111101111001011011100110111100111010111010100010101111111011001011011010101010111001011010110110000011111000111000001010001011111011001010101010000111111001101000001010100000 e799b2eca1bbebb5a9e79fa3eab395eba3bae9b59dec8888ebae86e88284eb8d83eb81bde5b9bceba8afecb6aae5ad83e3828becaa87e682a0
UHC 癲졻뵩矣곕룺鵝숈뮆肄덃끽幼먯춪孃る쪇悠 1110111110100110101000001110001010010100101001111110101111111000101100001110101110001111101011011110010010111101100110011110110010010010100101011110110010111101100010001110011010110011101000111110101011101010100100001110110010101101100001111110010110111110101010101110101110100101100000011110101011101101 efa6a0e294a7ebf8b0eb8fade4bd99ec9295ecbd88e6b3a3eaea90ecad87e5beaaeba581eaed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)