To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??乳??儀?? 111000011001111100111111001111111001001111111011001111110011111110001011010101100011111100111111 e19f3f3f93fb3f3f8b563f3f
EUC-JP 癲??乳??儀?? 111000101010000100111111001111111100011011111101001111110011111110110101101101110011111100111111 e2a13f3fc6fd3f3fb5b73f3f
UTF-8 癲띿슜乳들렟儀먮룆 111001111001100110110010111010111001110110111111111011001000101010011100111001001011100110110011111010111001001110100100111010111010000010011111111001011000010010000000111010111010100010101110111010111010001110000110 e799b2eb9dbfec8a9ce4b9b3eb93a4eba09fe58480eba8aeeba386
UHC 癲띿슜乳들렟儀먮룆 111011111010011010001101111011001001101010101001111010101110000110110101111010011000111010110000111010111111000010010000111010111000111110000101 efa68dec9aa9eae1b5e98eb0ebf090eb8f85

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)