To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 輿??衣??怨??要?????濡ル?筌 1001011101100000001111110011111110001000110111110011111100111111100010011000010100111111001111111001011101110110001111110011111100111111001111110011111110010100010001111000001110001011001111111110001010100011 97603f3f88df3f3f89853f3f97763f3f3f3f3f9447838b3fe2a3
EUC-JP 輿??衣??怨??要??彛??濡ル?筌 11001101110000010011111100111111101100001110000100111111001111111011000111100101001111110011111111001101110101110011111100111111100011111011110011111010001111110011111111000111101010001010010111101011001111111110010010100101 cdc13f3fb0e13f3fb1e53f3fcdd73f3f8fbcfa3f3fc7a8a5eb3fe4a5
UTF-8 輿삘넃衣쏙쭓怨뺤젞要쏄맏彛낉쭓濡ル솊筌 111010001011110010111111111011001000001010011000111010111000010010000011111010001010000110100011111011001000111110011001111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010011110111010001010011010000001111011001000111110000100111010111010011110001111111001011011110110011011111010111000001010001001111011001010110110010011111001101011111110100001111000111000001110101011111011001000011010001010111001111010110110001100 e8bcbfec8298eb8483e8a1a3ec8f99ecad93e680a8ebbaa4eca09ee8a681ec8f84eba78fe5bd9beb8289ecad93e6bfa1e383abec868ae7ad8c
UHC 輿삘넃衣쏙쭓怨뺤젞要쏄맏彛낉쭓濡ル솊筌 1110011010101011101110111110001010000110100100111110101111111101101111011110111110100111100010111110101010110011100101011110110010100000100110001110100110101001100110111110101010111000101110101110110010101101100001011110111110100111100010111110101110100001101010111110101110011001100011101110111110100111 e6abbbe28693ebfdbdefa78beab395eca098e9a99beab8baecad85efa78beba1abeb998eefa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)