To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??鷹??汚??議??柔ロ?沃 0011111100111111001111111110100011101000001111110011111110010001111010010011111100111111100010011001100000111111001111111000101101100011001111110011111110001111010111111000001110001101001111111001011110000000 3f3f3fe8e83f3f91e93f3f89983f3f8b633f3f8f5f838d3f9780
EUC-JP 艅??韋??鷹??汚??議??柔ロ?沃 10001111110101101111110100111111001111111111000011101010001111110011111111000010111010110011111100111111101100011111100000111111001111111011010111000100001111110011111110111101110000001010010111101101001111111100110111100000 8fd6fd3f3ff0ea3f3fc2eb3f3fb1f83f3fb5c43f3fbdc0a5ed3fcde0
UTF-8 艅덈퀩韋됧뵱鷹됱춷汚살늿議끿춯柔ロ닑沃 111010001000100110000101111010111000110110001000111011011000000010101001111010011001111110001011111010111001000010100111111010111011010110110001111010011011011110111001111010111001000010110001111011001011011010110111111001101011000110011010111011001000001010110100111010111000101010111111111010001010110110110000111010111000000110111111111011001011011010101111111001101001111110010100111000111000001110101101111010111000101110010001111001101011001010000011 e88985eb8d88ed80a9e99f8beb90a7ebb5b1e9b7b9eb90b1ecb6b7e6b19aec82b4eb8abfe8adb0eb81bfecb6afe69f94e383adeb8b91e6b283
UHC 艅덈퀩韋됧뵱鷹됱춷汚살늿議끿춯柔ロ닑沃 1110011010101001100010001110101110110011100111011110101011011111100010011110010110010100101011111110101111101101100010011110110010101101100100111110011111111101101110111110110010001000100010001110110010100001100001011110011110101101100011001110101011110101101010111110110110001000100101101110100010101010 e6a988ebb39deadf89e594afebed89ecad93e7fdbbec8888eca185e7ad8ceaf5abed8896e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)