To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???肉??袁??歪?;萸?????猥 00111111001111110011111110010011111101110011111100111111111001011100110100111111001111111001100001100011001111111000000101000111111001001100111000111111001111110011111100111111001111111110000011001110 3f3f3f93f73f3fe5cd3f3f98633f8147e4ce3f3f3f3f3fe0ce
EUC-JP ???肉??袁™?歪?;萸??洧??猥 0011111100111111001111111100011011111001001111110011111111101010110011111000111110100010111011110011111111001111110001000011111110100001101010001110100011010000001111110011111110001111110001111011010000111111001111111110000011010000 3f3f3fc6f93f3feacf8fa2ef3fcfc43fa1a8e8d03f3f8fc7b43f3fe0d0
UTF-8 麗몃쓹肉곈렟袁™끀歪묐;萸닸갭洧뺤뒠猥 111011111010011010001000111010111010101010000011111011001001001110111001111010001000001010001001111010101011001110001000111010111010000010011111111010001010001010000001111000101000010010100010111010111000000110000000111001101010110110101010111010111010110010010000111011111011110010011011111010001001000010111000111010111000101110111000111010101011000010101101111001101011010010100111111010111011101010100100111010111001001010100000111001111000110010100101 efa688ebaa83ec93b9e88289eab388eba09fe8a281e284a2eb8180e6adaaebac90efbc9be890b8eb8bb8eab0ade6b4a7ebbaa4eb92a0e78ca5
UHC 麗몃쓹肉곈렟袁™끀歪묐;萸닸갭洧뺤뒠猥 1110011010110000101110001110101110011101100101011110101110111111101100001110100110001110101100001110101010111110101000101110001010000101101101101110100011100000100100011110101110100011101110111110101110101101101101001110011010110000101110001110101011111011100101011110110010001010100111001110100011100101 e6b0b8eb9d95ebbfb0e98eb0eabea2e285b6e8e091eba3bbebadb4e6b0b8eafb95ec8a9ce8e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)