To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??鎰??臾??嚥〓?釉o?臾??筌??苑?? 11100010101000110011111100111111111010000100110000111111001111111110010001101011001111110011111110011010100010111000000110101100001111111110011111010110100000101000111100111111111001000110101100111111001111111110001010100011001111110011111110001001100100010011111100111111 e2a33f3fe84c3f3fe46b3f3f9a8b81ac3fe7d6828f3fe46b3f3fe2a33f3f89913f3f
EUC-JP 筌??鎰??臾??嚥〓?釉o?臾??筌??苑?? 11100100101001010011111100111111111011111010110100111111001111111110011111001100001111110011111111010011111010111010001010101110001111111110111011011000101000111110111100111111111001111100110000111111001111111110010010100101001111110011111110110001111100010011111100111111 e4a53f3fefad3f3fe7cc3f3fd3eba2ae3feed8a3ef3fe7cc3f3fe4a53f3fb1f13f3f
UTF-8 筌뗪릿鎰싷쭓臾먯졐嚥〓슣釉o쭓臾먯졐筌뗭쥙苑길략 111001111010110110001100111010111001011110101010111010111010011010111111111010011000111010110000111011001000101110110111111011001010110110010011111010001000011110111110111010111010100010101111111011001010000110010000111001011001101010100101111000111000000010010011111011001000101010100011111010011000011110001001111011111011110110001111111011001010110110010011111010001000011110111110111010111010100010101111111011001010000110010000111001111010110110001100111010111001011110101101111011001010010110011001111010001000101110010001111010101011100010111000111010111001111010110101 e7ad8ceb97aaeba6bfe98eb0ec8bb7ecad93e887beeba8afeca190e59aa5e38093ec8aa3e98789efbd8fecad93e887beeba8afeca190e7ad8ceb97adeca599e88b91eab8b8eb9eb5
UHC 筌뗪릿鎰싷쭓臾먯졐嚥〓슣釉o쭓臾먯졐筌뗭쥙苑길략 111011111010011110001011111010101011100010110100111011001111000010011010111011111010011110001011111010111010110010010000111011001010000010111101111001101011111110100001111010111001101010101111111010111011100010100011111011111010011110001011111010111010110010010000111011001010000010111101111011111010011110001011111011001010001010001110111010101011110110110001111001101011011110101011 efa78beab8b4ecf09aefa78bebac90eca0bde6bfa1eb9aafebb8a3efa78bebac90eca0bdefa78beca28eeabdb1e6b7ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)