To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 玉??汚??節⑨???????岳??節??蘂 1000101111001010001111110011111110001001100110000011111100111111100100001101111110000111010010000011111100111111001111110011111100111111001111110011111110001010011110000011111100111111100100001101111100111111001111111110010101000001 8bca3f3f89983f3f90df87483f3f3f3f3f3f3f8a783f3f90df3f3fe541
EUC-JP 玉??汚??節??旿?????岳??節??蘂 101101101100110000111111001111111011000111111000001111110011111111000000111000010011111100111111100011111100000111110100001111110011111100111111001111110011111110110011110110010011111100111111110000001110000100111111001111111110100110100010 b6cc3f3fb1f83f3fc0e13f3f8fc1f43f3f3f3f3fb3d93f3fc0e13f3fe9a2
UTF-8 玉쀯숱汚꾡옅節⑨슬旿듿몚連얕웶岳띰쉔節썼쐥蘂 111001111000111010001001111011001000000010101111111011001000100010110001111001101011000110011010111010101011111010100001111011001001100010000101111001111010111110000000111000101001000110101000111011001000101010101100111001101001011110111111111010111001001110111111111010111010101010011010111011111010011010011010111011001001011010010101111011001001101110110110111001011011001010110011111010111001110110110000111011001000100110010100111001111010111110000000111011001000110110111100111011001001000010100101111010001001100010000010 e78e89ec80afec88b1e6b19aeabea1ec9885e7af80e291a8ec8aace697bfeb93bfebaa9aefa69aec9695ec9bb6e5b2b3eb9db0ec8994e7af80ec8dbcec90a5e89882
UHC 玉쀯숱汚꾡옅節⑨슬旿듿몚連얕웶岳띰쉔節썼쐥蘂 1110100010101100100101111110111110111101101000101110011111111101100001001110010010111111101101101110111110111101101010001110111110111101101111011110011111111010100010101110010110010001100010001110011011100110101111101110100010011111100001001110010010111111101101101110111110111101101010001110111110111101101111011110100010011100100010101110011111011110 e8ac97efbda2e7fd84e4bfb6efbda8efbdbde7fa8ae59188e6e6bee89f84e4bfb6efbda8efbdbde89c8ae7de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)