To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???幽??乙??^ 001111110011111100111111100101110100100000111111001111111000100110110011001111110011111101011110 3f3f3f97483f3f89b33f3f5e
EUC-JP ???幽??乙??^ 001111110011111100111111110011011010100100111111001111111011001010110101001111110011111101011110 3f3f3fcda93f3fb2b53f3f5e
UTF-8 黎싳궡幽경콢乙대뜲^ 11101111101001101000100111101100100010111011001111101010101101101010000111100101101110011011110111101010101100101011110111101100101111011010001011100100101110011001100111101011100011001000000011101011100111001011001001011110 efa689ec8bb3eab6a1e5b9bdeab2bdecbda2e4b999eb8c80eb9cb25e
UHC 黎싳궡幽경콢乙대뜲^ 11100110101100011001101011101100100000101011010011101010111010111011000011100110101100011001101011101011111000001011010011101011100011011011000001011110 e6b19aec82b4eaebb0e6b19aebe0b4eb8db05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)