To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??魏??怨ル????竊??儀??億??有 111010011111001000111111001111111110100110110000001111110011111110001001100001011000001110001011001111110011111100111111001111111110001010000110001111110011111110001011010101100011111100111111100010011010110100111111001111111001011101001100 e9f23f3fe9b03f3f8985838b3f3f3f3fe2863f3f8b563f3f89ad3f3f974c
EUC-JP 鶯??魏??怨ル????竊??儀??億??有 111100101111010000111111001111111111001010110010001111110011111110110001111001011010010111101011001111110011111100111111001111111110001111100110001111110011111110110101101101110011111100111111101100101010111100111111001111111100110110101101 f2f43f3ff2b23f3fb1e5a5eb3f3f3f3fe3e63f3fb5b73f3fb2af3f3fcdad
UTF-8 鶯볦눖魏꾣끽怨ル겱亮쎄램竊먲쬉儀뺢턂億됰슗有 111010011011011010101111111010111011001110100110111010111000100010010110111010011010110110001111111010101011111010100011111010111000000110111101111001101000000010101000111000111000001110101011111010101011001010110001111011111010010110110111111011001000111010000100111010111001111010101000111001111010101110001010111010111010100010110010111011001010110010001001111001011000010010000000111010111011101010100010111011011000010010000010111001011000010010000100111010111001000010110000111011001000101010010111111001101001110010001001 e9b6afebb3a6eb8896e9ad8feabea3eb81bde680a8e383abeab2b1efa5b7ec8e84eb9ea8e7ab8aeba8b2ecac89e58480ebbaa2ed8482e58484eb90b0ec8a97e69c89
UHC 鶯볦눖魏꾣끽怨ル겱亮쎄램竊먲쬉儀뺢턂億됰슗有 1110010110100011100100111110110010000111101100001110101011100000100001001110011010110011101000111110101010110011101010111110101110000001101111011110010110111001101111011110101010110111101001011110111110111100100100001110111110100110100111111110101111110000100101011110101010110101100111101110010111100010100010011110101110011010101001101110101011110011 e5a393ec87b0eae084e6b3a3eab3abeb81bde5b9bdeab7a5efbc90efa69febf095eab59ee5e289eb9aa6eaf3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)