To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ????潮????腫????潮????腫^ 00111111001111110011111100111111100100101010101000111111001111110011111100111111100011101110111000111111001111110011111100111111100100101010101000111111001111110011111100111111100011101110111001011110 3f3f3f3f92aa3f3f3f3f8eee3f3f3f3f92aa3f3f3f3f8eee5e
EUC-JP ????潮????腫????潮????腫^ 00111111001111110011111100111111110001001010110000111111001111110011111100111111101111001111000000111111001111110011111100111111110001001010110000111111001111110011111100111111101111001111000001011110 3f3f3f3fc4ac3f3f3f3fbcf03f3f3f3fc4ac3f3f3f3fbcf05e
UTF-8 렯롊렯렡潮렯롔렯렮腫렯롊렯렡潮렯롔렯렮腫^ 11101011101000001010111111101011101000011000101011101011101000001010111111101011101000001010000111100110101111011010111011101011101000001010111111101011101000011001010011101011101000001010111111101011101000001010111011101000100001011010101111101011101000001010111111101011101000011000101011101011101000001010111111101011101000001010000111100110101111011010111011101011101000001010111111101011101000011001010011101011101000001010111111101011101000001010111011101000100001011010101101011110 eba0afeba18aeba0afeba0a1e6bdaeeba0afeba194eba0afeba0aee885abeba0afeba18aeba0afeba0a1e6bdaeeba0afeba194eba0afeba0aee885ab5e
UHC 렯롊렯렡潮렯롔렯렮腫렯롊렯렡潮렯롔렯렮腫^ 1000111010111100100011101101000010001110101111001000111010110010111100001100110110001110101111001000111011011000100011101011110010001110101110111111000011111110100011101011110010001110110100001000111010111100100011101011001011110000110011011000111010111100100011101101100010001110101111001000111010111011111100001111111001011110 8ebc8ed08ebc8eb2f0cd8ebc8ed88ebc8ebbf0fe8ebc8ed08ebc8eb2f0cd8ebc8ed88ebc8ebbf0fe5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)