To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??Þ???????????Þ?????????B 00111111001111111101111000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101111000111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3fde3f3f3f3f3f3f3f3f3f3f3fde3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲?????松??嚥≫?癲?????松??嚥≫?B 111000011001111100111111001111110011111100111111001111111000111110111100001111110011111110011010100010111000000111100010001111111110000110011111001111110011111100111111001111110011111110001111101111000011111100111111100110101000101110000001111000100011111101000010 e19f3f3f3f3f3f8fbc3f3f9a8b81e23fe19f3f3f3f3f3f8fbc3f3f9a8b81e23f42
EUC-JP 癲?Þ???松??嚥≫?癲?Þ???松??嚥≫?B 11100010101000010011111110001111101010011011000000111111001111110011111110111110101111100011111100111111110100111110101110100010111001000011111111100010101000010011111110001111101010011011000000111111001111110011111110111110101111100011111100111111110100111110101110100010111001000011111101000010 e2a13f8fa9b03f3f3fbebe3f3fd3eba2e43fe2a13f8fa9b03f3f3fbebe3f3fd3eba2e43f42
UTF-8 癲앸Þ璘뗧㎉松썬럹嚥≫늼癲앸Þ璘뗧㎉松썬럹嚥≫늼B 1110011110011001101100101110110010010101101110001100001110011110111011111010011110101111111010111001011110100111111000111000111010001001111001101001110110111110111011001000110110101100111010111001111110111001111001011001101010100101111000101000100110101011111010111000101010111100111001111001100110110010111011001001010110111000110000111001111011101111101001111010111111101011100101111010011111100011100011101000100111100110100111011011111011101100100011011010110011101011100111111011100111100101100110101010010111100010100010011010101111101011100010101011110001000010 e799b2ec95b8c39eefa7afeb97a7e38e89e69dbeec8daceb9fb9e59aa5e289abeb8abce799b2ec95b8c39eefa7afeb97a7e38e89e69dbeec8daceb9fb9e59aa5e289abeb8abc42
UHC 癲앸Þ璘뗧㎉松썬럹嚥≫늼癲앸Þ璘뗧㎉松썬럹嚥≫늼B 11101111101001101001110111101011101010001010110111101100110111101000101111100111101001111011101111100001111001101011110111100011100011101001100011100110101111111010000111101101100010001000010111101111101001101001110111101011101010001010110111101100110111101000101111100111101001111011101111100001111001101011110111100011100011101001100011100110101111111010000111101101100010001000010101000010 efa69deba8adecde8be7a7bbe1e6bde38e98e6bfa1ed8885efa69deba8adecde8be7a7bbe1e6bde38e98e6bfa1ed888542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)