To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??怨封?鎧??疑?醍???疑?低頭? 001111110011111110001001100001011001010110010101001111111000101001011010001111110011111110001011010111100011111110010001111001110011111100111111001111111000101101011110001111111001001011100001100100111010101000111111 3f3f898595953f8a5a3f3f8b5e3f91e73f3f3f8b5e3f92e193aa3f
EUC-JP ??怨封?鎧??疑?醍???疑?低頭? 001111110011111110110001111001011100100111110101001111111011001110111011001111110011111110110101101111110011111111000010111010010011111100111111001111111011010110111111001111111100010011100011110001101010110000111111 3f3fb1e5c9f53fb3bb3f3fb5bf3fc2e93f3f3fb5bf3fc4e3c6ac3f
UTF-8 欌렪怨封렮鎧欌렪疑렑醍낯欌렪疑렑低頭떵 111001101010110010001100111010111010000010101010111001101000000010101000111001011011000010000001111010111010000010101110111010011000111010100111111001101010110010001100111010111010000010101010111001111001011010010001111010111010000010010001111010011000011010001101111010111000001010101111111001101010110010001100111010111010000010101010111001111001011010010001111010111010000010010001111001001011110110001110111010011010000010101101111010111001011010110101 e6ac8ceba0aae680a8e5b081eba0aee98ea7e6ac8ceba0aae79691eba091e9868deb82afe6ac8ceba0aae79691eba091e4bd8ee9a0adeb96b5
UHC 欌렪怨封렮鎧欌렪疑렑醍낯欌렪疑렑低頭떵 1110110111101011100011101011100011101010101100111101110011100110100011101011101111001011110100011110110111101011100011101011100011101011111101111000111010100110111100001011010110110011101110001110110111101011100011101011100011101011111101111000111010100110111011101011100011010100111010011011011010111010 edeb8eb8eab3dce68ebbcbd1edeb8eb8ebf78ea6f0b5b3b8edeb8eb8ebf78ea6eeb8d4e9b6ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)