To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畏??詭厄??異??楡??豫??由?┸ 100010001101100000111111001111111110011001101011100101101110111100111111001111111000100011011001001111110011111110011110101111100011111100111111100110001010110000111111001111111001011101010010001111111000010010111101 88d83f3fe66b96ef3f3f88d93f3f9ebe3f3f98ac3f3f97523f84bd
EUC-JP 畏??詭厄??異??楡??豫??由?┸ 101100001101101000111111001111111110101111001100110011001111000100111111001111111011000011011011001111110011111111011100110000000011111100111111110100001010111000111111001111111100110110110011001111111010100010111111 b0da3f3febccccf13f3fb0db3f3fdcc03f3fd0ae3f3fcdb33fa8bf
UTF-8 畏븐떓詭厄댁떔異양챻楡녹뒋豫뗭엻由뱄┸ 111001111001010110001111111010111011100010010000111010111001011010010011111010001010100110101101111001011000111010000100111010111000110010000001111010111001011010010100111001111001010110110000111011001001011010010001111011001011000110111011111001101010010110100001111010111000010110111001111010111001001010001011111010001011000110101011111010111001011110101101111011001001011110111011111001111001010010110001111010111011000110000100111000101001010010111000 e7958febb890eb9693e8a9ade58e84eb8c81eb9694e795b0ec9691ecb1bbe6a5a1eb85b9eb928be8b1abeb97adec97bbe794b1ebb184e294b8
UHC 畏븐떓詭厄댁떔異양챻楡녹뒋豫뗭엻由뱄┸ 1110100011100110101110101110110010001011101010011100111111111000111001001111100010110100111011001000101110101010111011001011011010111110111001111010101010001000111010101111100010110011111011001000101010001000111001111110001110001011111011001001111010001101111010111010011010111001111011111010011010111111 e8e6baec8ba9cff8e4f8b4ec8baaecb6bee7aa88eaf8b3ec8a88e7e38bec9e8deba6b9efa6bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)